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Abstract. We show that for each /3 > 0, every digraph G of sufficiently large order n whose 
outdegree and indegree sequences ^ ... ^ and dj~ ^ . . . ^ d~ satisfy df ,d~ ^ 
min {i + /Bn, n/2} is Hamiltonian. In fact, we can weaken these assumptions to 

(i) d+ ^ min {i + fin, n/2} or d~_._i^^ ^ n - i; 

(ii) d^ ^ min {i + /Sn, n/2} or d+_j_^„ ^ n - i; 

and still deduce that G is Hamiltonian. This provides an approximate version of a conjecture 
of Nash- Williams from 1975 and improves a previous result of Kiihn, Osthus and Treglown. 



1. Introduction 
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I ' The decision problem of whether a graph contains a Hamilton cycle is one of the most 

I famous NP-complete problems, and so it is unlikely that there exists a good characterization 

' of all Hamiltonian graphs. For this reason, it is natural to ask for sufficient conditions which 

ensure Hamiltonicity. The most basic result of this kind is Dirac's theorem [6j, which states 
that every graph of order n ^ 3 and minimum degree at least n/2 is Hamiltonian. 

Dirac's theorem was followed by a series of results by various authors giving even weaker 
conditions which still guarantee Hamiltonicity. An appealing example is a theorem of Posa [20] 
' which implies that every graph of order n ^ 3 whose degree sequence di ^ ^2 ^ . . . ^ (i„ 

. satisfies di ^ i + 1 for all i < n/2 is Hamiltonian. Finally, Chvatal showed that if the 

degree sequence of a graph G satisfies ^ i + 1 or dn-i ^ n — i whenever i < n/2, then G 
is Hamiltonian. Chvatal's condition is best possible in the sense that for every sequence not 
satisfying this condition, there is a non-Hamiltonian graph whose degree sequence majorises the 
^ [ given sequence. 

It is natural to seek analogues of these theorems for digraphs. For basic terminology on 
I digraphs, we refer the reader to the monograph of Bang- Jensen and Gutin [2]. Ghouila-Houri [S] 

' proved that every digraph of order n and minimum indegree and outdegree at least n/2 is 

. Hamiltonian, thus providing such an analogue of Dirac's theorem for digraphs. Thomassen [21] 

I asked the corresponding question for oriented graphs (digraphs with no 2-cycles). One might 

■ expect that a weaker minimum semidegree (i.e. indegree and outdegree) condition would suffice 

in this case. Haggkvist [9] gave a construction showing that a minimum semidegree of ^"g"^ 
is necessary and conjectured that it is also sufficient to guarantee a Hamilton cycle in any 
^ ■ oriented graph of order n. This conjecture was recently proved in [llj, following an asymptotic 

^ I solution in [12]. In [5] we gave an NC algorithm for finding Hamilton cycles in digraphs with a 

certain robust expansion property which captures several previously known criteria for finding 
Hamilton cycles. These and other results are also discussed in the recent survey [17\ . 

However, no digraph analogue of Chvatal's theorem is known. For a digraph G of order n, 
let us write df{G) ^ ... ^ d^{G) for its outdegree sequence, and d^(G) ^ ... ^ d~{G) for 
its indegree sequence. We will usually write df and d~ instead of df{G) and d^{G) if this is 
unambiguous. 

The following conjecture of Nash- Williams [19j would provide such an analogue. 

Conjecture 1. Let G he a strongly connected digraph of order n ^ 3 and suppose that for all 
i < n/2 

(i) df ^ i + 1 or d^_^ ^ n - i; 
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(ii) d- ^ i + l or d^_- ^ n - i. 
Then G contains a Hamilton cycle. 

Nash- Williams also highlighted the following conjectural analogue of Posa's theorem, which 
would follow from Conjectured) 

Conjecture 2. Let G be a digraph of order n ^ 3 such that df ,d~ ^ i + 1 for all i < (n — l)/2 
and f^p„/2] ' ^ r^/^l- ^^672 G contains a Hamilton cycle. 

Note that in Conjecture [2] the degree condition implies that G is strongly connected. It is 
not even known whether the above conditions guarantee the existence of a cycle though any 
given pair of vertices (see [3]). We will prove the following semi-exact form of Conjecture [2j It 
is 'semi-exact' in the sense that for half of the vertex degrees, we obtain the conjectured bound, 
whereas for the other half, we need an additional error term. 

Theorem 3. For every f3 > there exists an integer hq = no(/3) such that the following holds. 
Suppose G is a digraph on n ^ uq vertices such that df,d~ ^ min{i + /3n,n/2} whenever 
i < n/2. Then G contains a Hamilton cycle. 

Recently, the following approximate version of Conjectured] for large digraphs was proved by 
Kiihn, Osthus and Treglown [18) . 

Theorem 4. For every f3 > there exists an integer uq = no(/3) such that the following holds. 
Suppose G is a digraph on n ^ uq vertices such that for all i < n/2 



Then G contains a Hamilton cycle. 

We will extend this to the following theorem, which implies Theorem [31 

Theorem 5. For every (3 > there exists an integer hq = no(/3) such that the following holds. 
Suppose G is a digraph on n ^ no vertices such that for all i < n/2 

(i) df ^ min {i + f3n, n/2} or ^ n-i; 

(ii) d~ ^ min {i + f3n, n/2} or ^ n - i. 
Then G contains a Hamilton cycle. 

(For the purposes of our arguments it turns out that there is no significant difference in the 
use of the assumptions, so for simplicity the reader could just read our proof as it applies to 
Theorem [31) 

The improvement in the degree condition may at first appear minor, so we should stress 
that capping the degrees at n/2 makes the problem substantially more difficult, and we need 
to develop several new techniques in our solution. This point cannot be fully explained until 
we have given several definitions, but for the expert reader we make the following comment. 
Speaking very roughly, the general idea used in [12l dH [5] is to apply Szemeredi's Regularity 
Lemma, cover most of the reduced digraph by directed cycles, and then use the expansion prop- 
erty guaranteed by the degree conditions on G to link these cycles up into a Hamilton cycle 
while absorbing any exceptional vertices. When the degrees are capped at n/2 two additional 
difficulties arise: (i) the expansion property is no longer sufficient to link up the cycles, and (ii) 
failure of a previously used technique for reducing the size of the exceptional set. Our tech- 
niques for circumventing these difficulties seem instructive and potentially useful in attacking 
Conjectures [Hand [21 in full generality. 

Our paper is organized as follows. The next section contains some notation and Section 3 
some preliminary observations and examples. Our proof will use the machinery of Szemeredi's 
Regularity Lemma, which we describe in Section 4. (Unlike |12 ) 111 ) [5j. we do not require the 
Blow-up lemma.) Section 5 contains an overview of the proof in a special case that illustrates the 
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new methods that we introduce in this paper. The cycle covering result is proved in Section 6 
and the proof of the special case completed in Section 7. In Section 8 we describe the structures 
that arise in the general case. We establish some bounds for these structures in Section 9. Our 
main theorem is proved in Section 10. The final section contains a concluding remark. 

2. NOTATION 

Given two vertices x and y of a digraph G, we write xy for the edge directed from x to y. 
The order \G\ of G is the number of its vertices. We write Nq{x) and Nq{x) for the outneigh- 
bourhood and inneighbourhood of x and d^{x) and dQ^x) for its outdegree and indegree. The 
degree of x is dc{x) = d'^{x) + dQ{x). We usually drop the subscript G if this is unambiguous. 
The minimum degree and maximum degree of G are defined to be 6{G) = va.m.{d{x) : x G V{G)} 
and A(G) = max{(i(x) : x € V{G)} respectively. We define the minimum indegree S~{G) and 
minimum outdegree 5~^{G) similarly. The minimum semidegree is 6^{G) = min{S~^ (G) , 5^ (G)} . 
Given S C V{G) we write d'^{x) = |A^''"(x) H S\ for the outdegree of x in the set S. We define 
dg{x) and ds{x) similarly. Given a set A of vertices of G, we write Nq{A) for the set of all 
outneighbours of vertices of A, i.e. for the union of Nq{x) over all x ^ A. We define Nq{A) 
analogously. 

Given vertex sets A and i? in a graph or digraph G, we write Eg{A, B) for the set of all edges 
ab with a ^ A and b £ B and put edA^B) = \Eg{A, B)\. As usual, we drop the subscripts 
when this is unambiguous. If ^4 n -B = we write {A, B)g for the bipartite subgraph of G 
with vertex classes A and B whose set of edges is Eq(A,B). The restriction G[A] of G to A 
is the digraph with vertex set A and edge set all those edges of G with both endpoints in A. 
We also write G \ A for the digraph obtained by deleting A and all edges incident to it, i.e. 
G\A = G[V{G)\A\. 

Cycles and paths will always be understood as directed cycles and directed paths, even if this 
is not explicitly stated. Given two vertices x and y on a directed cycle C we write xCy for the 
subpath of G from x to y. Similarly, given two vertices x and y on a directed path P such that 
X precedes y, we write xPy for the subpath of P from x to y. A walk of length £ in a digraph 
G is a sequence vo,vi, . . . ,V£ of vertices of G such that VjVj+i G E{G) for all ^ i ^ £ — 1. 
The walk is closed if vq = V£. A 1 -factor of G is a collection of disjoint cycles which cover all 
vertices of G. Given a 1-factor F of G and a vertex x of G, we write x~^ and xj, for the successor 
and predecessor of x on the cycle in F containing x. We usually drop the subscript F if this is 
unambiguous. We say that x and y are at distance d on F if they belong to the same directed 
cycle G in F and the distance from j; to y or from y to x on G is d. Note in particular that 
with this definition, x and y could be at distance d and d' on F with d ^ d' . 

A digraph G is strongly connected if for any ordered pair of vertices (x, y) there is a directed 
walk from x to y. A separator of G is a set S of vertices such that G\S is not strongly connected. 
We say G is strongly k- connected if |G| > k and if it has no separator of size less than k. By 
Menger's theorem, this is equivalent to the property that for any ordered pair of vertices (x, y) 
there are k internally disjoint paths from x to y. 

We write a = 6 it c to mean that the real numbers a, b, c satisfy |a — fc| ^ c. We sometimes 
also write an expression such as d^{x) ^ t to mean d'^{x) ^ t and d~{x) ^ t. The use of the it 
sign will be clear from the context. 

To avoid unnecessarily complicated calculations we will sometimes omit fioor and ceiling signs 
and treat large numbers as if they were integers. 

3. Preliminaries 

In this section we record some simple consequences of our degree assumptions and describe 
the examples showing that Conjectures [1] and [2] would be best possible. We also recall two 
results on graph matchings and a standard large deviation inequality (the Chernoff bound) . 

Our degree assumptions are that for alH < n/2 we have 
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(i) df ^ mill {i + /3n, n/2} or ^ n - i; 

(ii) d~ ^ mill {i + n/2} or ^ n - i. 

We claim that = ^ /3n. For if this were false our assumptions would give ^ 

n — 1, i.e. G contains at least /3n + 1 vertices of indegree n — 1. But a vertex of indegree n — 1 is 
an outneighbour of all other vertices, so this also implies that 5^{G) ^ f3n. Similarly we have 
5-{G) ^ /3n. 

To avoid complications with boundary cases it will be convenient to drop the condition 
i < n/2. We note that this does not change our assumptions. For if n/2 ^ i < n — fin we can 
apply our assumption (i) to i' = n — i — fin and get ^ min{i' + fin, n/2} or df^_-,_p^ ^ n — i', 

i.e. fi^_j_^^ ^ n — i or df ^ i + f^n, which implies assumption (ii) for i. Similarly assumption 
(ii) for i' implies assumption (i) for i. The assumptions do not make sense for i ^ n — fin, but 
if we consider any statement about d^ with j ^ as being vacuous (i.e. always true), then 
we do not have to impose any conditions when i ^ n — fin. 

For an extremal example for Conjectures [T] and [51 consider a digraph G on n vertices con- 
structed as follows. The vertex set is partitioned as IL) K with \I\ = k < n/2 and \K\ = n — k. 
We make I independent and K complete. Then we pick a set X oi k vertices of K and add 
all possible edges in both directions between / and X. This gives a strongly connected non- 
Hamiltonian digraph G in which both the indegree and outdegree sequence are 

k,...,k,n— 1 — k,...,n— 1 — k,n — l,...,n— 1. 

k times n—2k times k times 

G fails conditions (i) and (ii) in Conjecture [1] for i = k and also one of the conditions in 
Conjecture [2j In fact, a more complicated example is given in [18] where only one condition 
in Conjecture [T] fails. So, if true, Conjecture [T] would be best possible in the same sense as 
Chvatal's theorem. 

A matching in a graph or digraph G is a set of pairwise disjoint edges. A cover is a set C of 
vertices such that every edge of G is incident to at least one vertex in C. For bipartite graphs 
these concepts are related by the following classical result of Konig. 

Proposition 6. In any bipartite graph, a maximum matching and a minimum cover have equal 
size. 

The following result, known as the 'defect Hall theorem', may be easily deduced from Propo- 
sition [6l using the observation that if C is a cover then N{A \ C) C C n i?. 

Proposition 7. Suppose G is a bipartite graph with vertex classes A and B and there is some 
number D such that for any S C A we have |A^(5')| ^ \S\ — D. Then G contains a matching of 
size at least \A\ — D. 

We will also need the following well-known fact. 

Proposition 8. Suppose that J is a digraph such that |A^+(S')| ^ l^l for every S C V{J). Then 
J has a 1- factor. 

Proof. The result follows immediately by applying Proposition [7] (with D = 0) to the following 
bipartite graph F: both vertex classes A,B of F are copies of the vertex set of the original 
digraph J and we connect a vertex a G vl to 6 G i? in F if there is a directed edge from a to 6 
in J. A perfect matching in F corresponds to a 1-factor in J. □ 

We conclude by recording the Chernoff bounds for binomial and hyper geometric distributions 
(see e.g. Corollary 2.3 and Theorem 2.10]). Recall that the binomial random variable 
with parameters {n,p) is the sum of n independent Bernoulli variables, each taking value 1 
with probability p or with probability 1 — p. The hypergeometric random variable X with 
parameters (n, m, k) is defined as follows. We let be a set of size n, fix C A of size 
\S\ = m, pick a uniformly random T C A of size |r| = k, then define X = \T r\ S\. Note that 
EA = km/n. 
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Proposition 9. Suppose X has binomial or hypergeometric distribution and < a < 3/2. Then 
¥{\X - EX\ ^ aEX) ^ 2e" Vi^^. 

4. Regularity 

The proof of Theorem[5]will use the directed version of Szemeredi's Regularity Lemma. In this 
section, we state a digraph form of this lemma and establish some additional useful properties. 
For surveys on the Regularity Lemma and its applications we refer the reader to \15\ \TE[ [T6] . 

4.1. The Regularity Lemma. The density of a bipartite graph G = {A, B) with vertex classes 
A and B is defined to be dG{A,B) = ^^^jy^- We often write d{A,B) if this is unambiguous. 
Given e > 0, we say that G is e-regular if for all subsets X C A and Y C B with \X\ ^ e\A\ and 
|y| ^ e\B\ we have that \d(X, Y) — d{A, B)\ < e. Given d € [0, 1], we say that G is (e, d)-regular 
if it is e-regular of density at least d. We also say that G is (e, d)- super-regular if it is e-regular 
and furthermore dQ[a) ^ d\B\ for all a G A and dcQ)) ^ d\A\ for all b £ B. 

Given a digraph G, and disjoint subsets A,B of V{G), we say that the ordered pair {A,B)g 
is e-regular, if the corresponding undirected bipartite graph induced by the edges of G from A 
to B is e-regular. We use a similar convention for super-regularity. The Diregularity Lemma is 
a version of the Regularity Lemma for digraphs due to Alon and Shapira [Ij. We will use the 
degree form of the Diregularity Lemma, which can be easily derived from the standard version, 
in exactly the same manner as the undirected degree form. (See e.g. [16] for a sketch proof.) 

Lemma 10 (Diregularity Lemma; Degree form). For every e G (0, 1) and each positive integer 
M' , there are positive integers M and no such that if G is a digraph on n ^ uq vertices, 
d G [0, 1] is any real number, then there is a partition of the vertices of G into Vq, Vi,. . . ,Vk 
and a spanning subdigraph G' of G with the following properties: 

• M' ^ /c ^ M; 

• l^ol ^ en, iFil = • • • = I Vfcl =: m and G'[Vi\ is empty for all ^ i ^ k; 

• dg, (x) > dQ{x) — (d + e)n and dQ,{x) > dQ{x) — {d + e)n for all x G y{G); 

• all pairs (V^, V^)g'/ with 1 ^ i,j ^ k and i ^ j are e-regular with density either or at 
least d. 

Note that we do not require the densities of (V^, Vj)G> and {Vj, Vi)G> to be the same. We call 
Vi, . . . ,Vk the clusters of the partition, Vq the exceptional set and the vertices of G in Vq the 
exceptional vertices. The reduced digraph R = Rqi of G with parameters e, d, M' (with respect 
to the above partition) is the digraph whose vertices are the clusters Vi, . . . ,Vk and in which 
ViVj is an edge precisely when {Vi,Vj)G' has density at least d. 

In various stages of our proof of Theorem [5l we will want to make some pairs of clusters 
super-regular, while retaining the regularity of all other pairs. This can be achieved by the 
following folklore lemma. Here and later on we write < oi ^ 02 to mean that we can choose 
the constants 02 and ai from right to left. More precisely, there is an increasing function / such 
that, given 02, whenever we choose some oi ^ /(02) all calculations in the proof of Lemma [TT] 
are valid. Hierarchies with more constants are to be understood in a similar way. 

Lemma 11. Let < e <^ d, 1/A and let R be a reduced digraph of G as given by Lemma [TU. 
Let H be a subdigraph of R of maximum degree A. Then, we can move exactly Aem vertices 
from each cluster Vi into Vq such that each pair of clusters corresponding to an edge of Li 
becomes (2e, super- regular, while each pair of clusters corresponding to an edge of R becomes 
(2e, d — e) -regular. 

Proof. For each cluster V G V{R), let 

\Ni^,{x) n W\ < {d — e)m for some out-neighbour W oi V in H 1 
or lA'^^, (x) n W\ < {d — e)m for some in-neighbour 1^ of y in H j ' 
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The definition of regularity implies that ^ Aem. Remove from each cluster V a set of 

size exactly Aem containing A{V). Since Ae ^ ^, it follows easily that all pairs corresponding to 
edges of R become (2e, d— e)-regular. Moreover, the minimum degree of each pair corresponding 
to an edge of H is at least (d — (A + l)e)m ^ ^m, as required. □ 

Next we note the easy fact that regular pairs have nearly perfect matchings and super-regular 
pairs have perfect matchings. 

Lemma 12. Suppose e > and G = (^4, B) is an (e, 2e)-regular pair with \A\ = \B\ = n. Then 
G contains a matching of size at least (1 — e)n. Furthermore, if G is {e,2e)- super-regular then 
G has a perfect matching. 

Proof. For the first statement we verify the conditions of the defect Hall theorem (Proposition!?]) 
with D = en. We need to show that |A^(S')| ^ \S\ — D for S Q A. We can assume that 
IS" I ^ D = en. Then by e-regularity, all but at most en vertices in B have at least elS*! > 
neighbours in S. Therefore | A^(5)| ^ (1 — e)n ^ 15*1 — en, as required. For the second statement 
we need to show that |A^(S')| ^ |5| for S" C A. For any x £ S we have d{x) ^ 2en by super- 
regularity, so we can assume that IS"! ^ 2en. Then as before we have |A^(5)| ^ (1 — e)n, so we 
can assume that |5| > (1 — e)n. But we also have d{y) ^ 2en for any y £ B, so N{y) n 5 7^ 0, 
i.e. N{S) = B and \N{S)\=n;^\S\. □ 

We will also need the following regularity criterion for finding a Hamilton cycle in a non- 
bipartite digraph. We say that a general digraph G on n vertices is e-regular of density d if 
'^jxjjy^^ = d ± e for all (not necessarily disjoint) subsets X, Y of V{G) of size at least en, and 
{e,d)- super-regular if it is e-regular and 6^{G) ^ dn. 

Lemma 13. Suppose 0<e<^d<^l, nis sufficiently large and G is an {e,d)- super-regular 
digraph on n vertices. Then G is Hamiltonian. 

In fact, Frieze and Krivelevich [21 Theorem 4] proved that an (e, (i)-super-regular digraph has 
{d — 4e^/^)n edge-disjoint Hamilton cycles, which is a substantial strengthening of Lemma [T3l 
Lemma [13] can also be deduced from Lemma 10 in |11] . 

Next we need a construction that we will use to preserve super-regularity of a pair when 
certain specified vertices are excluded. 

Lemma 14. Suppose < e <ti d <ti 1, G = {A,B) is an {e,d) -super-regular pair with \A\ = 
\B\ = n sufficiently large and X ^ A with \X\ ^ n/3. Then there is a set Y Q B with \Y\ = \X\ 
such that {A \ X, B \ Y)g is {2e, d/ 2) -super-regular. 

Proof. If \X\ ^ 2en then we choose Y arbitrarily with |y| = \X\. Next suppose that \X\ > 2en. 
We let Bi be the set of vertices in B that have less than ^d\A \ X\ neighbours in A \ X. 
Then \Bi\ ^ en hy e-regularity of G. Consider choosing ^ -B \ -Bi of size \X\ — \Bi\ 
uniformly at random. For any x in A its degree in B2 is dB2{x) = \Ng{x) B2\, which has 
hypergeometric distribution with parameters {\B\ Bi\,dB\Bx{x)-, \ B2\)- Super-regularity gives 
£[^^2(2;)] = dB\Bi{x)\B2\/\B \ Bi\ > edn/2, and the Chernoff bound (Proposition [9]) applied 
with a = n"^/^ /^.ds^ix) > n-i/3 gives P(|(is2(x)-EdB2(x)| > n^/^) < 2e-'^"'^'/3 < 2e-"'^'/3_ gy 
a union bound, there is some choice of B2 so that every x in ^ has dfijl^) — dB\Biix)\B2\/\B \ 
Bi\±n'^/^ < 0.4dB{x) (say). Let Y = B1UB2. Then {A\X,B\Y)g is 2e-regular, bye-regularity 
of G. Furthermore, every y £ B \ Y has d^^xiy) ^ ^d\A \ X\ by definition of Bi, and every x 
in A\X Q Ahas dB\Y{x) ^ c?b(x) - \Bi\ - dB^ix) ^ ^d\B\Y\. □ 

Finally, given an (e, d)-super-regular pair G = {A,B), we will often need to isolate a small 
subpair that maintains super-regularity in any subpair that contains it. For A* <^ A and B* B 
we say that {A*,B*) is an {e* , d*)-ideal for {A, B) if for any A* O A' C A and B* C B' (1 B 
the pair {A',B') is (e*, (i*)-super-regular. The following lemma shows that ideals exist, and 
moreover randomly chosen sets A* and B* form an ideal with high probability. 
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Lemma 15. Suppose < e <^ 0, d < 1/2, n is sufficiently large and G = {A, B) is (e, d)-super- 
regular with re/2 ^ ^ re. Let A* A and B* B be independent uniformly random 

subsets of size On. Then with high probability {A*,B*) is an {e/6,6d/A)-ideal for {A^B). 

Proof. First we note that e-regularity of G implies that {A' ,B') is e/^-regular for any A' A 
and B' C B with ^ 6n. For each x ^ A, the degree of x in B* is dB*{x) = 

\Nc{x) n B*\, which has hyper geometric distribution with parameters 0re) and ex- 

pectation IE[dB*(2;)] ^ Odcix). By super-regularity we have dc^x) ^ (ire/2, so by the Chernoff 
bound (Proposition [9|) applied with a = 0dre/4E[dB* (x)] ^ d/4, we have '¥{dB*{x) < Odn/A) < 
By a union bound, there is some choice of B* so that every x ^ A has at least Odn/A 
neighbours in B* , and so at least {6d/A)\B'\ neighbours in B' for any B* <^ B' CI B. Arguing 
similarly for A* gives the result. □ 



5. Overview of the proof 

We will first prove a special case of Theorem [SJ Although it would be possible to give a 
single argument that covers all cases, we believe it is instructive to understand the methods in 
a simplified setting before introducing additional complications. This section gives an overview 
of our techniques. We begin by defining additional constants such that 

— <e<d<7<(i'<r/<r/'</3^1. 
no 

Note that this hierarchy of parameters will be used throughout the paper. By applying the 
Diregularity Lemma to G with parameters e,d and M' = 1/e, we obtain a reduced digraph 
Rg' on k clusters of size rre and an exceptional set Vq. We will see that the degree sequences of 
Rg' inherit many of the properties of the degree sequences of G. Then it will follow that Rq' 
contains a union of cycles F which covers all but at most 0{d^/'^k) of the clusters of Rg'- We 
move the vertices of all clusters not covered by F into Vq. By moving some further vertices into 
Vo we can assume that all edges of F correspond to super-regular pairs. 

Let Rq, be the digraph obtained from Rg' by adding the set Vq of exceptional vertices and 
for each x € Vq and each V E Rg' adding the edge xV if x has an outneighbour in V and the 
edge Vx if x has an inneighbour in V. We would like to find a closed walk W in Rq, such that 

(a) For each cycle G of F, W visits every cluster of G the same number of times, say mc; 

(b) We have 1 ^ mc ^ i.e. W visits every cluster at least once but not too many times; 

(c) W visits every vertex of Vq exactly once; 

(d) For each Xi £ Vq we can choose an inneighbour x~ in the cluster preceding Xi on W and 
an outneighbour xf in the cluster following Xi on W, so that as Xj ranges over Vq all 
vertices xf, x~ are distinct. 

If we could find such a walk W then by properties (a) and (b) we can arrange that mc = m 
for each cycle C of by going round G an extra m — mc times on one particular visit of W 
to G. Then we could apply properties (c) and (d) to choose inneighbours and outneighbours 
for every vertex of Vq such that all choices are distinct. Finally, we could apply a powerful tool 
known as the Blow-up Lemma (see [14J) to find a Hamilton cycle Gnam in G corresponding to 
W , where Gnam has the property that whenever W visits a vertex of Vq, Gnam visits the same 
vertex, and whenever W visits a cluster Vi of Rg'-, then Gnam visits a vertex x G V^. (We will 
not discuss the Blow-up Lemma further, as in fact we will take a different approach that does 
not need it.) 

To achieve property (a), we will build up W from certain 'shifted' walks, each of them 
satisfying property (a). Suppose i? is a digraph, R' is a subdigraph of R, F \s a. 1-factor in R 
and a, h are vertices. A shifted walk (with respect to R' and F) from a to 6 is a walk VF(a, b) of 
the form 

W{a, b) = XiCiXf . . . XtGtX^Xt+i, 
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where Xi = a, Xt+i = b, Ci is the cycle of F containing Xj, and for each 1 ^ -i ^ t, is the 
predecessor of Xi on Cj and the edge X^ XiJ^i befongs to R' . We say that W[a,h) traverses 
the cycles Ci, . . . ,Ct. Note that even if the cycles Ci, . . . ,Ct are not distinct we say that W 
traverses t cycles. Note also that, for every cycle C of F, the walk W{a, b)\b visits the vertices 
of C an equal number of times. 

Given a shifted walk W = W{a, b) as above we say that W uses X if X appears in the list 
{X^ , . . . ,Xt,Xf,Xt+i}. More generally, we say that X is used s times by W if it appears s 
times in the above list (counting multiplicities). Thus W uses 2t clusters, counting multiplicities. 
We say that W internally uses X if X £ {X2,X2 , ■ ■ ■ ,Xt,Xf} (i.e. we do not count the uses 
of Xj~ or Xt+i). We also refer to the uses of X2, ■ ■ ■ ,Xt+i as entrance uses and X^ , . . . ,Xf 
as exit uses. If X is used as both Xi and Xj for some 2^i<j^t + l then we can obtain a 
shorter shifted walk from a to 6 by deleting the segment of W between Xi and Xj (retaining 
one of them). Similarly, we can obtain a shorter shifted walk if X is used as both X^" and XJ' 
for some 1 ^ i < j ^ t. Thus we can always choose shifted walks so that any cluster is used at 
most once as an entrance and at most once as an exit, and so is used at most twice in total. 

We say that a cluster V is entered a times hy W if W contains a edges whose final vertex is 
V and which do not lie in F (where the edges of W are counted with multiplicities). We have 
a similar definition for exiting V a times. 

Next we define an auxiliary digraph H that plays a crucial role in our argument. Let Rq" 
be the spanning subdigraph of Re obtained by deleting all those edges corresponding to a pair 
of clusters whose density is less than d'. Let F be the 1-factor of Re mentioned above. The 
vertices of H are the clusters of Rg"- We have an edge from a to 6 in if there is a shifted 
walk with respect to Rq" and F from a to 6 which traverses exactly one cycle. One can view 
-ff as a 'shifted version' of Rg"- 

For now we will only consider the special case in which H is highly connected. Even then, 
the fact that the exceptional set Vq can be much bigger than the cluster sizes creates a difficulty 
in ensuring property (b), that W does not visit a cluster too many times. A natural attempt to 
overcome this difficulty is the technique from [5]. In that paper we split each cluster Vi of Re 
into two equal pieces V^^ and V-^. If the splitting is done at random, then with high probability, 
the super-regularity between pairs of clusters corresponding to the edges of F is preserved. We 
then applied the Diregularity Lemma to the subdigraph of G induced by Vq U U • • • U 
with parameters £2,^2 and = l/£2 to obtain a reduced graph R2 and an exceptional set 
Vq. The advantage gained is that by choosing £2 ^2 <C e the exceptional set Vq becomes 
much smaller than the original cluster sizes and there is no difficulty with property (b) above. 
However, the catch is that in our present case the degrees are capped at n/2, and in the course 
of constructing the union of cycles in R2 we would have to enlarge Vq to such an extent that 
this approach breaks down. 

Our solution is to replace condition (b) by the following property for W: 

(b') W visits every cluster of Rq' at least once but does not use any cluster of Rq' too many 
times. 



This condition can be guaranteed by the high connectivity property of H. However, we now 
have to deal with the fact that W may 'wind around' each cycle of F too many times. This will 
be addressed by a shortcutting technique, where for each cycle C in F we consider the required 
uses of C en masse and reassign routes so as not to overload any part of C. A side-effect of 
this procedure is that we may obtain a union of cycles, rather than a single Hamilton cycle. 
However, using a judicious choice of W and a switching procedure for matchings, we will be 
able to arrange that these shortcuts do produce a single Hamilton cycle. In particular, this 
approach does not rely on the Blow-up Lemma. 
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6. Structure I: Covering the reduced digraph by cycles 

We start the proof by applying the Diregularity Lemma (Lemma [T0|) to G with parameters 
e, d and M' = 1/e, obtaining a reduced digraph Rqi on k clusters of size m and an exceptional 
set Vq. Initially we have |Vo| ^ en, although we will add vertices to Vq during the argument. 
Note also that n = km + |Vo|. 

6.1. Properties of Rq'- Our main aim in this section is to show that Rq' contains an almost 
1-factor F, more specifically, a disjoint union of directed cycles covering all but at most 7d^^^k 
vertices of Re- To begin with, we show that the degree sequences of Re have similar properties 
to the degree sequences of G. 

Lemma 16. 

(i) dtiRG')^idl^{G)-2dk; 

(ii) d-{RG')^l^d-^{G)-2dk; 

(iii) 5+{Rg') ^ ^k; 

(iv) 5-{Rg') > f A:; 

(v) df{RG') ^ min \i + ^k, (i - 2d) k\ or d7 .{Rc) >k-i- 2dk; 

(vi) dr{RG>) ^ minji + f A;, (i - 2d) A;| or d'^^_^_^^^_.{RG') ^k-i- 2dk. 

Proof. We will only prove parts (i),(iii) and (v). Parts (ii), (iv) and (vi) can be obtained in 
exactly the same way as parts (i), (iii) and (v) respectively, by interchanging + and — signs. 
Consider i clusters with outdegrees at most df{RG') in Rg'- These clusters contain im vertices 
of G, so must include a vertex x of outdegree at least df^{G). Lemma fTOl implies that the 
cluster V containing x satisfies 

3 

d±{x) i^d^,{x) + {d + e)ni^dt {V)m + {d + e)n + \Vo\ d+ {V)m + -dn. 
Therefore 

1 S r? 1 

dfiRc) ^ d+jV) ^ -dUG) - -d- ^ -dUG) - 2dk, 

which proves (i). Next, (iii) follows from (i), since 5'^{G) ^ f3n. To prove (v), suppose that 
dfiRc) < min |z + f A;, (i - 2d) A;|. It follows from (i) that 

dtm(G) < mm{m{i + l3k/2 + 2dk),mk/2} < min{im + /3n,n/2}. 
Using our degree assumptions gives (i~_j^_^„(G') n — im. Then by (ii) we have 

dV B^ iRG')>—d7 B^ (G) -2dk ^ —d7, . (G) - 2dk ^ k - i - 2dk, 
(i-|)fc-r ^ ^ m (i-f )fcm-im^ ' rn ^ ^ ' 

as required. □ 

Unfortunately, Rg' need not satisfy the hypothesis of Proposition [51 so we cannot use it to 
deduce the existence of a 1-factor in Rg' ■ The next lemma shows that a problem can only occur 
for subsets 5 of V{Rg') of size close to k/2. 

Lemma 17. Let S be a subset ofV{RG') such that either \S\ ^ {l/2-2d)k or \S\ > {l/2 + 2d)k. 
Then \N+{S)\,\N-{S)\ ^ \S\. 

Proof. Suppose firstly that \S\ ^ (1/2 — 2d)k but |A^^(S')| < By the minimum outdegree 
condition of Rg' (Lemma [16] (iii)) we must have |5| ^ f3k/2. Also ^^|5|_2dfc-i(-^G'') ^ ^^|5|(-^G') < 
15*1 ^ (1/2 — 2d)k, so Lemma [TBI (v) gives '^(^_^/2)fc-|5|+2dfc+i(-^G'') ^ A; — |5| + 1. Thus there 
are at least /3k/2 + l^l — 2dk ^ \S\ vertices of indegree at least k — \S\ + 1. Now if x has 
indegree at least k — \S\ + 1 then N~{x) intersects S, so x belongs to N^{S). We deduce 
that |A^"'"(5)| ^ l-S]. A similar argument shows that |A^~(S')| ^ \S\ as well. Now suppose that 
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\S\ > (l/2 + 2(i)A: but \N+{S)\ < \S\, and consider T = V{Rg')\N+{S). Since TV" (T) n S = 0, 
we have |A^~(r)| < |r|, and so |r| > (1/2 — 2d)k by the first case. But now we can consider a 
subsetT'ofTof size |r'| = (l/2-2(i)/c to see that |A^~(r)| ^ |A^-(r')| ^ \T'\ = {l/2-2d)k, and 
so IS*! ^ {l/2 + 2d)k, a contradiction. The claim for |A^"(S')| follows by a similar argument. □ 

Applying Hall's theorem as in Proposition [HI one can use Lemma [T71 to partition the vertex 
set of Rg' into a union of cycles and at most Adk paths. However, for our approach we need to 
find a disjoint union of cycles covering almost all the vertices. The first step towards this goal 
will be to arrange that for each path its initial vertex has large indegree and its final vertex has 
large outdegree. To prepare the ground, we show in the next lemma that if Rg' does not have 
a 1-factor, then it has many vertices of large outdegree and many vertices of large indegree. 

Lemma 18. If Rg' does not have a 1-factor, then it contains more than {l/2 + 2d)k vertices of 
outdegree at least {l/2 — 2d)k and more than (l/2 + 2d)k vertices of indegree at least {l/2 — 2d)k. 

Proof. Since Rg' does not have a 1-factor, by Proposition [8] it contains a set S with |A^'^(S')| < 
I SI. Then by Lemma [16] (i) we have 

\S\ > \N+{S)\ ^ d+ (i?GO ^ ^<|5|(G) - 2dk, 

and so 

Moreover, (1/2 — 2d)k < \S\ ^ (1/2 + 2d)k by Lemma [T71 So if it were also the case that 
^^|S|-i!„(^) <n/2 = min{m|5|+/3n/2,n/2}, then d^^_^/2)n-m|5| (G') ^ {1 + P/2)n-m\S\ and 
so by Lemma [16] (ii) we would have 

^(l + ^y-\S\-2dk^k- \S\ + 1. 

Then Rg' contains at least l3k/4 + \S\ vertices of indegree at least — IS"! + 1, and these must all 
belong to N^{S), a contradiction. It follows that d'^ ^ (G) ^ n/2. So Lemma [T6l (i) gives 

TTl I o ] — "2" ^ 

i.e. Rg' contains at least {1 + l3/2)k — \S\ ^ {l/2 + 2d)k vertices of outdegree at least {l/2 — 2d)k, 
which proves the first part of the lemma. The second part can be proved in exactly the same 
way. □ 

Now we can show how to arrange the degree property for the paths. 

Lemma 19. The vertex set of Rg' can be partitioned into a union of cycles and at most Adk 
paths such that the initial vertices of the paths each have indegree at least (1/2 — 2d)k and the 
final vertices of the paths each have outdegree at least (1/2 — 2d)k. 

Proof. We may assume that Rg' does not have a 1-factor and so the consequences of Lemma [THl 
hold. We define an auxiliary digraph R'q, by adding Adk new vertices vi,V2, ■ ■ ■ ,W4dfc to Rg', 
adding all possible edges between these vertices (in both directions), adding all edges of the form 
vvi, where 1 ^ i ^ idk and v is a vertex of Rg' of outdegree at least (1/2 — 2d)k and finally 
adding all edges of the form ViV where 1 ^ « ^ Adk and v is a vertex of Rg' of indegree at least 
{l/2 — 2d)k. Then any vertex that previously had indegree at least {l/2 — 2d)k now has indegree 
at least (1/2 -|- 2d) A;, and similarly for outdegree. Also, Lemma \T8\ implies that every new vertex 
Vi has indegree and outdegree more than {l/2 + 2d)k. We claim that R'q, has a 1-factor. Having 
proved this, the result will follow by removing vi,. . . ,VMk from the cycles in the 1-factor. To 
prove the claim, let us take S C V{R'q,). By Proposition [8] we need to show that |A^+(S')| ^ l^l. 
We consider cases according to the size of S. If \S\ ^ (1/2 — 2d)k, then either S C V{Rg'), 
in which case |A^^(5')| ^ l^l by Lemma [TT] or S contains some new vertex Vi, in which case 
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\N+{S)\ ^ d+{vi) ^ {l/2 + 2d)k ^ \S\. Next suppose that {l/2-2d)k < \S\ ^ {l/2 + 2d)k. As 
before, if S contains a new vertex Vi we have |A^"'"(5)| ^ d^(vi) ^ (1/2 + 2d)k ^ \S\, so we can 
assume S C V{Rg')- Now by Lemma [TSl each new vertex Vi has at least (1/2 + 2d)k > k — \S\ 
inneighbours in V{Rg') and so Vi has an inneighbour in S, i.e. Vi G N^{S). Also, S has at least 
{l/2-2d)k outneighbours in Re by LemmaHH so in R'^, we have |A^+(5')| ^ Mk+{l/2-2d)k ^ 
IS"]. Finally suppose that |5| > (1/2 + 2d)k. Let T = V{R'(.,) \ N+{S). Considering a subset 
5' C 5 of size (1/2 + 2d)k shows that |r| ^ fc - |A^+(5')| ^k- \S'\ = (1/2 - 2d)k. However, 
N^[T) is disjoint from S, so if |A^"^(S')| < |5| we have |r| > |A^^(r)|. Now similar arguments 
to before give \T\ > (1/2 + 2d)k, a contradiction. □ 

6.2. The almost 1-factor. We now come to the main result of this section. 

Lemma 20. Re contains a disjoint union F of cycles covering all hut at most 7d^^^k of its 
vertices. 

Proof. We implement the following algorithm. At each stage, the vertex set of Re will be 
partitioned into some cycles and paths and a waste set W. In every path the initial vertex will 
have indegree at least (1/2 — 2d)k and the final vertex will have outdegree at least (1/2 — 2d)k. 
One of the paths will be designated as the 'active path'. 

In the initial step, we begin with the partition guaranteed by Lemma [T9j We have = 
and choose an arbitrary path to be active. 

In each iterative step we have some active path P. Let u be the initial vertex of P and v 
its final vertex. Let S be the sum of the numbers of vertices in all the paths. If at any point 
S ^ 5d^/^k, then we move the vertices of all these paths into W and stop. Otherwise we define 
a = 5dk/S and for each path P^, we let £r = a|-Fr|- Note that the parameters S, a and {£r} are 
recalculated at each step. By our assumption on S we have a ^ d^/^. Also Y^r^r = aS = 5dk. 

For each cycle C = wi . . . wtwi and X C V{C) we write X+ = {i^i+i : wi G X} for the set 
of successors of vertices of X. For each path Pr = wi . . .wt, X C and 1 ^ s ^ t we let 
X"*"^ = {wj : 3wi £X,i<j^i + s}. Also, for each path Pj. = wi . . .wt which contains at least 
one outneighbour of v we let zjf ^ be minimal such that ifj^+i G N~^{v) H Pr- Similarly, for 
each path P^- = wi . . . wt which contains at least one inneighbour of u we let ^ be minimal 
such that wt-i^ G N^{u) n Pr- We claim that at least one of the following conditions holds: 

(1) There is a w G such that wu,vw G E{Rqi). 

(2) There is a cycle C = wi . . . WiWi+i . . . wtwi such that WiU, vwi+i G E{Rqi). 

(3) There is a path P,. = wi . . . wt and 1 ^ i < j ^ t with j — i ^ £r + 1 such that 
WiU,vwj G E{Rqi). 

(4) There is a path Pr = wi ... Wt with i" ^ or ^ 

To see this, suppose to the contrary that all these conditions fail. Since (1) fails, then {N^[u) n 
W) n {N+{v) r\W) = % and so 

• \N-{u) r\w\ + \N+{v) n VFK l^^l- 

Since (2) fails, then for each cycle C we have {N~{u) n C)+ n {N^{v) n C) = and so 

• \N-{u) r\C\ + \N+{v) n C7| ^ \C\ for each C. 

Since (4) fails then for each path P^ we have \N~' {u)f^Pr\ ^ — and \N^{v)r\Pr\ ^ — ^r- 
In particular 

• for each path P^, if Pr does not meet both N~{u) and N^{v) then \N'{u) n P^l + 

|A^+(?;)nP^| ^ \Pr\ -Ir- 

On the other hand if a path P,. meets both N~{u) and N~^{v) then, since (3) fails we have 
{N-{u) n P^)+('^'+i) n {N+{v) n Pr) = 0. Moreover, since i]i > ir and since also (4) fails, we 
also have that \{N-{u) D Pr)+(^'-+^)| ^ |A^"(n) n Pr\ +ir. Altogether this gives that 

• for each path Pr, ii Pr meets both N~{u) and N+{v) then \N' {u)nPr\ + \N+ {v)nPr \ ^ 
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Summing these inequalities gives 

d-{u) + d+{v) <\W\+Y^ \C\ + Y,{\Pr\ -ir) = k- 

C r r 

But we also have J2r^r = = 5dk and d~{u),d'^{v) ^ (1/2 — 2d)k by the degree property of 
the paths. This contradiction shows that at least one of the conditions (l)-(4) holds. 
According to the above conditions we take one of the following actions. 

(1) Suppose there is a w GW such that wu,vw G E{Rg')- Then we replace the path P by 
the cycle C = wuPvw, replace by \ {w}, choose a new active path, and repeat. 

(2) Suppose there is a cycle C = wi . . .wiWij^i . . .vOfWi such that WiU^vWiJ^i G E{Rqi). 
Then we replace the path P and the cycle C by the cycle C" = . . . WiuPvwi^i . . . WfWi , 
choose a new active path, and repeat. 

(3) Suppose there is a path P^ = wi . . . wt and 1 ^ i < j ^ t with j — i ^ ir + ^ such that 
WiUjVWj G E{Rqi). 

(i) liPr P then we replace the paths P and Pr with the path P^. = wi . . . WiuPvwj . . .wt, 
replace W with W U {lUj+i, . . . ,Wj-i}, make the new active path, and repeat. 

(ii) li Pr = P (so wi = u and wt = v) then we replace P with the cycles Cu = 
UW2 ■ ■ ■ Wi-iWiU and = vwj . . . wt-iv, replace W with W U {wj+i, . . . , Wj-i}, choose 
a new active path, and repeat. 

(4) Suppose there is a path Pr = wi . . . Wt with ^ £r or ^ ir- 

(i) If Pr ^ P and ^ ir then we replace the paths P and P,- with the path P^. = 
wi . . . wt-iuuPv, replace W with W U {wt-i^+i, ■ ■ ■ , wt}, make the new active path, 
and repeat. 

(ii) If Pr ^ P and ir ^ then we replace the paths P and Pr with the path 
Pr = uPvwiv j^i . . . Wt, replace W with WU {wi, . . . , Wiv}, make P/ the new active path, 
and repeat. 

(iii) If Pr = P (so wi = u and wt = v) and ^ ir then we replace P with the cycle 
C = uPwt-iuu, replace W with W U {wt-i'i^+i, . . ■ ,wt}, choose a new active path, and 
repeat. 

(iv) If P^ = P and ir ^ ir then we replace P with the cycle C = vwiv_^_iPv, replace 
W with W U {wi, . . . , Wiv}, choose a new active path, and repeat. 

At each step the number of paths is reduced by at least 1, so the algorithm will terminate. 
It remains to show that \W\ ^ Td^/^fe. Recall that at every step we have ir = (x\Pr\ ^ (i^/^|Pr| 
for each path P^. For every vertex w added to W we charge its contribution to the path that w 
initially belonged to. To calculate the total contribution we break it down by the above cases 
and by initial paths. Cases (1) and (2) do not increase the size of W. In Case 3(i), every initial 
path Pr is merged with an active path P at most once, and then its remaining vertices stay in 
the active path until a new active path is chosen, so this gives a contribution to W of at most 
ir ^ (i^/^|Pr | from Pr- In Case 3(ii), the vertices of the active path P,. = P are contained in a 
union \JiQjV{Pi) of some subset of the initial paths (excluding some vertices already moved into 
W). These paths collectively contribute at most a\P\ ^ d}^'^J2iei\Pi\^ ^^^^ initial path 
is merged at most once into such a path P. In Cases 4(i) and 4(ii), as in Case 3(i), an initial 
path Pr contributes at most a|Pr|. In Cases 4(iii) and 4(iv), as in Case 3(ii), the vertices of the 
active path P^ = P are contained in a union Ui^jV{Pi) of some subset of the initial paths and 
contribute at most a\P\ ^ d^/^ Yliei 1-^*1- ^'^ ^^^^ initial path contributes to W at most twice: 
once when it is merged into the active path (in Cases 3(i), 4(i) or 4(ii)) and once when this 
active path is turned into one or two cycles (in Cases 3(ii), 4(iii) or 4(iv)). Therefore we get a 
total contribution from the paths of at most 2d^/'^k to W . Finally, there is another contribution 
of at most 5d^/'^k if at any step we have S ^ 5d^/'^k. In total we have \W\ < Id^/'^k. □ 

6.3. Further properties of F. Now we have an almost 1-factor F in Pg", i.e. a disjoint union 
of cycles covering all but at most Td^^'^k clusters of Rq'- We move all vertices of these uncovered 
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clusters into Vq, which now has size at most 8d^/^n. During the proof of Theorem [5] it wih be 
helpful to arrange that each cycle of F has length at least 4 (say) and moreover, all pairs of 
clusters corresponding to edges of F correspond to super-regular pairs. (This assumption on 
the lengths is not actually necessary but does make some of the arguments in the final section 
more transparent.) 

We will now show that we may assume this. Indeed, if F contains cycles of lengths less than 
4, we arbitrarily partition each cluster of Rq' into 2 parts of equal size. (If the sizes of the 
clusters are not divisible by 2, then before the partitioning we move at most 1 vertex from each 
cluster into Vq in order to achieve this.) Consider the digraph Rq, whose vertices correspond to 
the parts and where two vertices are joined by an edge if the corresponding bipartite subdigraph 
of G' is (2e, f )-regular. It is easy to check that this digraph contains the 2-fold 'blowup' of 
Rg', i-e. each original vertex is replaced by an independent set of 2 new vertices and there is 
an edge from a new vertex j; to a new vertex y if there was such an edge between the original 
vertices. Each cycle of length £ F induces an 2-fold blowup of Ce in R'q, , which contains a 
cycle of length 2£ ^ 4. So R'q, contains a 1-factor F' all of whose cycles have length at least 4. 
Note that the size of Vq is now at most 9d^^'^n. 

Secondly, we apply Lemma [TT] to make the pairs of clusters corresponding to edges of F' 
(4e, |)-super-regular by moving exactly 4:e\Vi\ vertices from each cluster Vi into Vq and thus 
increasing the size of Vq to at most lOd^/^n. For convenience, having made these alterations, 
we will still denote the reduced digraph by Re, the order of Re by k, its vertices (the clusters) 
by Vi, . . . , Vfc and their sizes by m. We also rename F' as F. We sometimes refer to the cycles 
in F as F-cycles. 

6.4. A modified reduced digraph. Let Rg" be the spanning subdigraph obtained from Rg' 
by deleting all those edges which correspond to pairs of density at most d' . Recalling that 
d ^ d' , we note that the density of pairs corresponding to edges in Rg" is much larger than 
the proportion \Q^fd of vertices lying in Vq. The purpose of Rg' was to construct F so that 
this property would hold. Now we have no further use for Rg' and will work only with Rg"- 
(Actually, we could use either Rg" or Rg' for the special case in the next section, but we need 
to work with Rg" in general.) 

Let G" be the digraph obtained from G' obtained by deleting all edges belonging to pairs 
(A, Y) of clusters so that (A, Y)gi has density at most d' . We say that a vertex x G A is typical 
if 

• d^„{x) ^ d^{x) — id'n; 

• there are at most ^/£k clusters Y such that x does not have (libl/2)(ixy'7i outneighbours 
in Y, where dxY denotes the density of the pair {X,Y)g"- The analogous statement 
also holds for the inneighbourhood of x. 

Lemma 21. By moving exactly 16^/em vertices from each cluster into Vq, we can arrange that 
each vertex in each cluster of Rg" is typical. We still denote the cluster sizes by m. Then we 
have 

(i) dt{RG")^^dt^{G)-hd'k; 

(ii) d-{Re')-^^d-^{G)-bd'k; 

(iii) 5+{Rg") ^ ^k; 

(iv) 5-{RGn) ^ ^k; 

(v) df{RG") ^ minji + ^k, U - 5d') k] or d7 .(-^G") >k-i- 5d'k; 

(vi) dr{RG") ^ mm{i + f/c, (i - 5d') k} or dj .{Rg") >k-i- 5d'k. 

Proof. Suppose that we are given clusters A, Y such that XY is an edge of Rg'. Write dxY for 
the density of (A, Y)g'. We say that x £ X is out-typical for Y if (in G') x has (1 it l/3)(ixy?7i 
outneighbours in Y. Since the pair (A, Y)g' is 4£-regular, it follows that at most 8em vertices 
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of X are not out-typical for Y. Then on average, a vertex of X is not out-typical for at most 8ek 
clusters. It follows that there are at most 8y/em vertices x in X for which there are more than 
^/ek clusters Y such that x is not out-typical for Y. Therefore we can remove a set of exactly 
Sy^m vertices from each cluster so that all of the remaining vertices are out-typical for at least 
(1 — ^/£)k clusters. We proceed similarly for the inneighbourhood of each cluster. Altogether, 
we have removed exactly IQ^/em vertices from each cluster. These vertices are added to Vq, 
which now has size |Vo| ^ ll^fdn. Now consider some cluster X and a vertex x ^ X. Since x is 
out-typical for all but at most ^/ek clusters, it sends at most y/ek ■ m + k ■ 2d'm ^ 3d'n edges 
into clusters Y such that (X,Y)qi has density at most d' . Then the following estimate shows 
that X is typical: 

dQ"{x) ^ dc'ix) — 3d'n — \Vo\ ^ dcix) — {d + e)n — 3d'n — llVdn ^ ddx) — M'n. 

For (i)-(vi), we proceed similarly as in the proof of Lemma 1 161 For (i), consider i clusters with 
outdegrees at most df{Rc") in Re- These clusters contain im vertices of G, so must include 
a typical vertex x of outdegree at least df^{G). As in the previous estimate, the cluster V 
containing x satisfies 

dLiG) ^ dUx) ^ iV)m + M'n + \Vo\^ {V)m + ^d'n. 

Therefore 

1 9 r? 1 

dfiRG") ^ d+^„{V) ^ -dUG) - -d'- ^ -dUG) - 5d'k, 

which proves (i). Next, (iii) follows from (i), since 6^{G) ^ f3n. The proof of (v) is the same as 
that of (v) in Lemma [TBI with 2d replaced by 5d' . The proofs of the other three assertions are 
similar. □ 

By removing at most one extra vertex from each cluster we may assume that the size of each 
cluster is even. We continue to denote the sizes of the modified clusters by m and the set of 
exceptional vertices by Vq. The large-scale structure of our decomposition will not undergo any 
significant further changes: there will be no further changes to the cluster sizes, although in 
some subsequent cases we may add a small number of clusters to Vq in their entirety. For future 
reference we note the following properties: 

• \Vo\ ^ llVdn, 

• all edges of Rq" correspond to (lOe, (i'/2)-regular pairs (the deletion of atypical vertices 
may have reduced the densities slightly), 

• all edges of F correspond to (lOe, (i/4)-super-regular pairs. 

7. The highly connected case 

In this section we illustrate our methods by proving Theorem [5] in the case when the auxiliary 
graph H is strongly 7/A:-connected. We recall that d' <ti r] <^ 13, and that H was defined in 
Section [5] as a 'shifted version' of Rq" , i.e. there is an edge in H from a cluster to a cluster 
Vj if there is a shifted walk (with respect to Rg" and F) from Vi to Vj which traverses exactly 
one cycle. We refer to that section for the definitions of when a cluster is 'used' or 'internally 
used' by a shifted walk, and recall that we can assume that any cluster is used at most once as 
an entrance and at most once as an exit. 

Lemma 22. Suppose H is strongly ck-connected for some c > and a, b are vertices of H (i.e. 
clusters). Then there is a collection of at least c^k/lQ shifted walks (with respect to Rqh and 
F) from a to b such that each walk traverses at most 2/c cycles and each cluster is internally 
used by at most one of the walks. 

Proof. Since H is strongly c/c-connected we can find ck internally disjoint paths Pi,-- - ,Pck 
from a to b. There cannot be ck/2 of these paths each having at least 2/c internal vertices, as 
H has k vertices. Therefore H contains at least i := ck/2 internally disjoint paths Pi, . . . , 



A SEMI-EXACT DEGREE CONDITION FOR HAMILTON CYCLES IN DIGRAPHS 



15 



(say, after relabelling) from a to 6 which have length at most 2/c. Note that each of these 
corresponds to a shifted walk from a to 6 which traverses at most 2/c cycles. Let Wi, . . . , Wg 
denote these shifted walks. Since the Pi are internally disjoint, each cluster x is internally used 
by at most 2 of the shifted walks Wj (either as an entrance or as an exit). Each shifted walk Wi 
internally uses at most 4/c clusters, so there are at most 8/c — 1 other shifted walks Wj which 
internally use a cluster that Wi also uses internally. Thus we can greedily choose a subset of 
the walks Wi, . . . , Wi having the required properties. □ 

Given any cluster X, recall that we write for the successor of X on F and X~ for its 
predecessor. For every X, we apply Lemma [TCI with 9 = 16d to the (lOe, (i/4)-super-regular pair 
{X, X+)g' to obtain an {^/e, (P)-ideal {Xi,X^). Set X* := X1UX2 (where (Xf , X2) is the ideal 
chosen for {X~,X)). Then, by Lemma [T5l we have \X*\ ^ 2,2dm and for any X* ^ X' C X 
and (X+)* C (X+)' C X+ the pair (X', (X+)') is (^e, rf^)-super-regular. 

First we construct the walk W described in the overview. List the elements of the exceptional 
set as Vq = {xi, . . . ,Xr}. We go through the list sequentially, and for each Xi we pick clusters 
Xi and Yi of Re and vertices x~ £ N^{xi) n (Xj \ X*) and xf € D {Yi \ Y*) such 

that x^ ,xf, . . . are distinct and moreover no cluster of Rq" appears more than m/60 

times as a cluster of the form Xj,l^ (and thus no cluster appears more than m/20 times as a 
cluster of the form Xi, X^ ,Yi,Y~). To see that this is possible, recall that |Vo| ^ lld^/^n and 
|X*| ^ 32dm for all X. At most 3| Vo| vertices belong to Vq or to the set {x^ ,xf , . . . , x^_^,xf_j}, 
and at most 120|Vo| belong to clusters that appear at least m/60 times as Xj or Yj for xj with 
j < i. Therefore at most ISOOd^/^n < /3n ^ 6^{G) vertices are unavailable at stage z, so we 
can choose x~ and Xj as required. A similar argument applies for xf and Yi. Note that by 
construction each cluster contains at most m/20 of the vertices xf . 

Next we sequentially define shifted walks W{Yi,Xfj^-^) with respect to Rg" and F from Yi 
to Xj^-^ for 1 ^ i ^ r — 1. We want each W^Yi^X^]^^) to traverse at most 2/ri cycles and each 
cluster to be internally used at most m/30 times by the collection of all the walks W{Yi, X^^^). 
To see that this is possible, suppose we are about to find W {Yi, Xfj^-^^) and let A be the set 
of clusters internally used at least m/40 times by the walks W{Yj,X'^j^^) with j < i. Since 
each of our walks internally uses at most 4/r/ clusters (although it visits many more) we have 
1^1 ^ ^ rj^k/lQ (since d <^ rj). Now Lemma[22] implies that we can find a shifted walk 

W{Yi, X^-^) from Yi to X^-^^ that traverses at most 2/r] cycles and does not internally use any 
cluster in A. We may assume that W{Yi, X^-^) uses each cluster at most once as an entrance 
and at most once as an exit, and then no cluster is internally used more than 2-|-m/40 ^ m/30 
times by the collection of all the walks W{Yj,Xj'_^^) for all j ^ i, as required. 

We conclude this step by choosing a shifted walk W{Yr, Xf) from Yr to X-,^. Since there 
may be clusters in Rq// that we have not yet used, we construct this walk as a sequence of at 
most k shifted walks each traversing at most 2/r/ cycles, in such a way that every cluster is used 
at least once by W{Yr, Xf). 

This leads us to define a closed walk W with vertex set Vq U V{Rg") as follows. Let 
W(Yi, Xi^i) be the walk from Yi to Xj+i which is obtained from W{Yi, X^^^) by adding the 
path from X^^-^ to Xj+i in F. We now define 

W = XiWiYi,X2)x2 . . . XrWiYr, Xi)xi. 

Using the choice of the clusters Xj and Yi it is easy to see that W uses every cluster of Rq" 
at most m/20 + m/30 + k ■ 8/r] ^ m/10 times. Thus W has the properties mentioned in the 
overview, namely: 

(a) For each cycle C of F, W visits every vertex of C the same number of times; 
(b') W visits every cluster of Rg" at least once and uses every cluster of Rg" at most m/10 
times; 

(c) W visits every vertex of Vq exactly once; 
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(d) For each Xi £ Vq we have chosen an inneighbour x~ in the cluster Xi preceding Xi on W 
and an outneighbour xf in the cluster Yi following Xj on W, so that as Xj ranges over 
Vo all the vertices xf, x~ are distinct. 

Now we fix edges in G corresponding to all edges of W that do not lie within a cycle of F. We 
have already fixed the edges incident to vertices of Vq (properties (c) and (d)). Then we note 
that the remaining edges of W not in E{F) are precisely those of the form AB where A is used 
as an exit by W and B is used as an entrance by W . To see this, note that we cannot have 
A = B~ , as then B would be used twice as an entrance in one of the shifted walks constructed 
above, which is contrary to our assumption. Next we proceed through the clusters Vi, . . . ,Vk 
sequentially choosing edges as follows. When we come to Vi, we consider each j < i in turn. If 
ViVj ^ E(F) we let Wij be the number of times that W uses ViVj. Similarly, if VjVi ^ E{F) 
we let Wji be the number of times that W uses VjVi. We aim to choose a matching in G that 
avoids all previously chosen vertices and uses Wij edges from Vi \ V* to Vj \ V* and wji edges 
from Vj \V* to Vi \ V^*. This can be achieved greedily as follows. Suppose for example that 
Wij > and that when we come to Vj the available vertices are V^ C Vi and V^ C Vj. Since 
every cluster is used at most m/10 times we have Wij ^ m/10, and we have |^/|, |V^'| ^ m/2 
(say, taking account of at most m/10 uses, m/20 vertices and 32dm vertices in V* or V*). 
Then iV/,Vj)G" induces a (20e, (i'/3)-regular pair, so by Lemma [12] has a matching of size at 
least (1 — 20e)m/2 > Wij. The same argument can be used if we also have Wji > 0. After 
considering all such pairs (i, j) we have found edges in G corresponding to all edges of W that 
do not lie within a cycle of F. 

Now let Entry denote the set of all those vertices which do not lie in the exceptional set and 
which are the final vertex of an edge of G that we have fixed (i.e. the edges incident to the 
vertices in Vq and the edges chosen in the previous paragraph). Similarly, let Exit denote the 
set of all those vertices which do not lie in the exceptional set and which are the initial vertex 
of an edge of G that we have fixed. Note that Entry n Exit = 0. 

For every cluster U, let UexU -=11(1 Exit and U Entry = U (1 Entry. Since W was built up 
by shifted walks, it follows that \UExit\ = \UEntry\- Moreover, since we chose Entry and Exit 
to avoid U*, we know that {U \ UExit,U~^ \ U^^^^y)G' is (-y/e, d^)-super-regular, so contains a 
perfect matching by Lemma [121 Now the edges of these perfect matchings together with the 
edges of W that we fixed in the previous step form a 1-factor C of G. It remains to modify C 
into a Hamilton cycle of G. 

The following statement provides us with the tool we need. For any cluster U , let Gjj := 
(U" \ U^^-^, U \ UEntry)G' ^iid l^t Oldf/ be the perfect matching in Gu which is contained in C. 

For any cluster U, we can find a perfect matching Newjj in Gjj so that if we replace 
Oldu in C with Newu, then all vertices of Gu will lie on a common cycle in the new 
1-factor C. In particular, all vertices in U \ U Entry will lie on a common cycle Gjj in C (f) 
and moreover any pair of vertices of G that were formerly on a common cycle are still 
on a common cycle after we replace Oldjj by Newu . 

To prove this statement we proceed as follows. For every u & U \ U Entry, we move along the 
cycle Gu of C containing u (starting at u) and let f{u) be the first vertex on Gu in \ U^^-^. 
Define an auxiliary digraph J on U \ UEntry such that Nj'{u) := N^^{f{u)). So J is obtained 
by identifying each pair (n, /(n)) into one vertex with an edge from (n, f{u)) to {v, f{v)) if Gu 
has an edge from f{u) to v. Now Gu is (-v/e, (i^)-super-regular by the definition of the sets U*, 
so J is also (y^, (i^)-super-regular (according to the definition for non-bipartite digraphs). By 
Lemma [T3l J has a Hamilton cycle, which clearly corresponds to a perfect matching New[/ in G' 
with the desired property. 

Now we apply (f) to every cluster U sequentially. We continue to denote the resulting 1-factor 
by C and we write Gu for the cycle that now contains all vertices in ?7 \ UEntry Since UEntry 
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and UexU have size at most m/4 (say) for any U, we have V{Gu) n V{Gu+) ^ 0, so Cfj = Cu+. 
Then Cjj- = Cfj = Cu+, and since U Entry H UexU = 0, we deduce that Cjj actually contains all 
vertices of U . Then Cu = Cu+ implies that Cu contains all vertices lying in clusters belonging 
to the cycle of F containing U . 

We now claim that C is in fact a Hamilton cycle. For this, recall that W{Yr,Xf) used every 
cluster. Write WiYy^Xf) = U1C1U1U2C2U2 ■ ■ -UtCtUfUt+i, where each cluster appears at 
least once in Ui, . . . , Ut+i- Let u^Ui+i be the edge that we have chosen for the edge U~Ui+i 
on W{Yr, X^). Note that for each z = 1, . . . , t the vertices Uj+i and u~ lie on a common cycle 
of C, as this holds by construction of C, whatever matchings we use to create C. Since Ui,u^ £ Ui 
also lie on a common cycle, this means that all of -ui, . . . , ut (and thus also ut+i) lie on the same 
cycle C of C, which completes the proof. 

8. Structure II: Shifted components, transitions and the exceptional set 

Having illustrated our techniques in the case when H is strongly T//c-connected, we now turn 
to the case when this does not hold. In this section we impose further structure on G by 
introducing 'shifted components' of H and various matchings linking these components and the 
vertices of the exceptional set Vq. In the first subsection we construct the shifted components. 
We describe some of their properties in the second subsection. The third subsection describes 
a process by which our shifted walk W will make transitions between the shifted components. 
In the fourth subsection we partition Vq into 4 parts according to the existence of certain 
matchings between Vq and the remainder of the digraph. Then we complete the description of 
the transitions in the fifth subsection. Since we need to introduce a large amount of notation 
in this section, we conclude with a summary of the important points. 

We recaU that e<^d<^-/<^d'<^rj<^r]'<^f3 and |Vo| ^ Ud^^'^n. 

8.1. Shifted components of H. Note that the in- and outdegrees of H are obtained by 
permuting those of Rq", so H has the same in- and outdegree sequences as Rq", and the 
bounds in Lemma [21] also apply to H. We start by establishing an expansion property for 
subsets ofV{H). 

Lemma 23. If X C V{H) with \X\ ^ (1 - p)k/2 then 

\N^(X)\ ^ \X\ + -k- 5d'k - 1 ^ IXI + -k. 
-" 2 4 

Proof. The argument is similar to that for Lemma [T71 By symmetry it suffices to obtain the 

bound for \N^{X)\. Suppose for a contradiction that \N^{X)\ < \X\ + ^k — hd'k — 1. By 

Lemma [miii) we have \X\ > 5d'k + 1. Also ,.^,^__^(F) ^ |iV+(X)| < (1/2 - 5d')k, so 

by Lemma [2TT v) we have '^(i_^/2)fe-|x|+5d'fc+i(^) ^ k — \X\ + 1. Then H contains at least 
\X\ + l3k/2 — 5d'k vertices of indegree at least k — \X\ + 1, and these all belong to N^{X), a 
contradiction. □ 

We are assuming that H is not strongly r/A;-connected, so we can choose a separator S* of of 
size IS*! < Tjk. Thus we have a partition of the vertices of H into sets S, G, and D such that 
H \ S does not contain an edge from C to D (although it might contain edges from D to G). 

Lemma 24. |C|, \D\ = k/2 ± 2r]k. 

Proof. Suppose for a contradiction that \D\ < k/2 — 2i]k. If the stronger inequality \D\ ^ 
(1 - (3)k/2 holds then Lemma [53] implies that \Nj^{D)\ ^ \D\ + ^k > \D\ + \S\, a contradiction. 
So we may assume that \D\ ^ (1 — f3)k/2. Let D' be a subset of D of size (1 — /3)k/2. Now the 
first inequality of Lemma [23] implies that 

\Njj{D)\ ^ |A^^(i:>')l ^ k/2 - 5d'k - 1 > {\D\ + 2r]k) - 5d'k - 1 > \D\ + 7]k ^ \D\ + 

a contradiction. The bound \G\ ^ k/2 — 2r]k is obtained in a similar way, which proves the 
lemma. □ 
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Let Csmaii be the set of vertices in C which (in the digraph H) have at most f3k /lO inneigh- 
bours in C. Let -Dgmaii be the set of vertices in D which (in the digraph H) have at most f3k /lO 
outneighbours in D. 

Lemma 25. |Csmaii|, lAmaiil ^ 8r]k. 

Proof. Let Cbig be the set of vertices in C which have at least k/2 — r]k outneighbours in H. We 
claim that |Cbig| ^ pk/5. To see this, first note that Lemma [23] and the fact that there are no 
edges from C to D imply that D contains no vertex of indegree greater than |D| + ^ k/2+3r]k. 
So again by Lemma [531 the number of vertices of indegree greater than k/2 + 3r]k in H is at 
most k/2 + 3rjk, which gives d'^/2-3qk ^ k/2 + 3r]k. Now Lemma [2TTv) with i = /c/2 — /3fc/4 says 
that t^^/2-/3A:/4 ^ (1/2 ~ 5d')k or rf^/2-/3fc/4 ^ /^^/^ ~ 5d'k. The latter option cannot hold, 

as it would contradict our previous inequality for d^/2-3r)fc' ^° former option holds, and H 
has at least k/2 + /?A;/4 vertices of outdegree at least k/2 — 5d'k ^ k/2 — rjk. By Lemma [5^ C 
has to contain at least /3k/ 5 of these vertices of high outdegree, which proves the claim. 

Now note that yet another application of Lemma [Ml shows that every vertex in Cbig has at 
least k/2 — rjk — \S\ ^ |C| — A-qk outneighbours in H[C]. Suppose that ICgmaiil > ^rjk. Then 
every vertex in Cbig has more than half of the vertices of Cgmaii as outneighbours. This in turn 
implies that there is a vertex in Cgmaii with more than half the vertices in Cbig ^ inneighbours. 
In particular, it has more than /Sk/lO inneighbours in C. This contradicts the definition of 
Csmaiii so in fact |Csmaii| ^ 8r]k. The argument for -Dgmaii is similar. □ 

Let C := C \ Csmaii and := D \ Dsmaii- 
Lemma 26. H[C'] and H[D'] are strongly r]'k- connected. 

Proof. By symmetry it suffices to consider H[C']. The definition of Cgmaii and Lemma [25l give 
5~{H[C']) ^ /3A;/10-|Csmaii| ^ I3k/ll. Suppose that H[C'] is not strongly 7?'A;-connected. Then 
there is a separator T of size at most r/'fc and a partition [/, W ol C \ T such that H[C'] \ T 
contains no edge from U to W. Note that \W\ ^ 5^{H[C']) - \T\ ^ /3/c/12. So 

\U\ ^ |C'| - \W\ ^ {k/2 + 2r]k) - f3k/l2 ^ k/2 - f3k/l3. (1) 

If the stronger inequality \U\ ^ (1 — f3)k/2 holds then Lemma [23l implies that |A^^({7)| ^ 
|?7| +/3/c/4 > + |5| + \T\ + |Csmaii|) a contradiction. So we may assume that \U\ ^ (1 — /3)A;/2. 
Let U' be a subset of U of size (1 — (3)k/2. Now the first inequality in Lemma [23l implies that 

\N+{U)\ ^ \N+{U')\ ^ k/2 - bd'k - 1 ^ {\U\ + Pk/13) - 5d'k -1>\U\ + \S\ + |T| + |Csmaii|, 
a contradiction again. □ 

Let S' be the set obtained from S by adding Csmaii and Dsmaii- So ^ 177]k and S' , C, 
D' is a vertex partition of H. 

Now let L (for 'left') be the set obtained from C by adding all those vertices v from S' which 
satisfy \N^{v) D C'\ ^ rj'k and \N^{v) D C'\ ^ rj'k. Next, let R (for 'right') be the set obtained 
from D' by adding all those remaining vertices v from S' which satisfy \N^{v) H D'\ ^ rj'k 
and \N^{v) n D'\ ^ rj'k. Then H[L] and H[R] are both still ??'A:-connected. We write My (for 
'vertical middle') for the remaining vertices in S' (i.e. those which were not added to C or D'). 
Then |My| ^ \S'\ ^ 17r]k. Moreover, L, My and R partition the vertex set of Re- 

We also define another partition of V{Rg") into three sets which we call T,Mh and B (for 
'top', 'horizontal middle', and 'bottom') as follows: 

• a cluster belongs to T if and only if its successor in F belongs to L; 

• a cluster belongs to Mh if and only if its successor in F belongs to My; 

• a cluster belongs to B if and only if its successor in F belongs to R. 
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Figure 1. Shifted components and the exceptional set 

The general picture (including a partition of Vq defined below) is illustrated in Figure 1. For 
each of the above subsets of V{H) = V{Rg") we use a 'tilde' notation to denote the subset of 
V(G) consisting of the union of the corresponding clusters, thus L = Ujj^lU C V{G), etc. Note 
that 

\Mh\ = \Mv\^ 17r]k. (2) 
We need to remove certain cycles from F that would create difficulties later on. Let M := 
Mv U Mh- We say that a cycle C of F significantly intersects M if |C n M| ^ |C|/10. If we 
have |M| ^ I^I/t^ then we remove all cycles that significantly intersect M from F and add all 
vertices in their clusters to the exceptional set. Since d ^ 7 we still have the inequality 

\Vo\ ^ Ud^/^n + 10 • Ud^/^n/j^ ^ d^^^n. (3) 
Later we will distinguish the following two cases according to the size of M. 
(★) \M\ ^ |Vo|/7^. Moreover, no cycle of F significantly intersects M. 
(^) \M\ ^ IF0I/7' > 0. 

(The proof would be considerably simpler if we could remove all the cycles which significantly 
intersect/lie in M in the case but this would make |Vo| too large.) Since any cycle in F 
has equal intersection sizes with Mh and My we still have \Mh\ = \Mv\ of size at most 17r]n. 
We still denote the remaining subset of R by R, and similarly for all the other sets B, L, My 
etc. 

8.2. Properties of the shifted components. We start by justifying the name 'shifted com- 
ponents'. The following lemma shows that we have decomposed most of the digraph into two 
pieces of roughly equal size, where in each piece we have the high connectivity that enabled us 
to establish the result in the previous section. 

Lemma 27. 

(i) H[L\ and H[Fl\ are strongly rj'k/ 2- connected. 

(ii) |L|, 1^1, |5|,|f I =n/2± 197711. 

Proof. To prove (i), recall that before the removal of the cycles we knew that H[L] and H[R] 
were strongly T^'fc-connected. By ([3]), the number of clusters removed in case {*) is at most 
d^^^n/m ^ r]'k/2. Since we only removed entire F-cycles, we did not delete any edges from H 
other than those incident to the clusters that were deleted. Thus for each cluster removed the 
connectivity only decreases by at most one. 

For (ii), we recall that \C\ = k/2^2r]k (Lemma I24|) and L was obtained from C by removing 
jC'smaiil ^ 8?7A; clusters, adding at most ^ llrjk clusters and removing at most d'^l'^n/m ^ rjk 
clusters. The argument for R is the same. The other two bounds follow since \B\ = \R\ and 
\T\ = \L\. □ 

Next we define a partition of My into My^ and My^ as follows. A cluster X G My belongs 
to M^^ if \N+{X) n C'\ < r]'k and |iV^(X) n D'\ < rj'k. A cluster X e My belongs to M^^ 
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if \N+{X) n D'\ < 7]'k and |iV^(X) n C'\ < rj'k. The definition of L and R and the fact that 
H has minimum semidegree at least 13k/ 2 imply that this is indeed a partition of My. Since 
\Mv\ ^ I77]k and 6^{H) ^ (3k/2 we have the following properties. 

Lemma 28. 

(i) AUV € M^^ satisfy |iV^(y)nL|, |iV^(y)ni?| < 2r]'k and \N+{V)r\R\,\N^{V)r\L\ > 
f3k/3. 

(ii) AllV eM^^ satisfy \N+{V)nR\,\Njj(y)nL\ <2r]'k and \N+(y)nL\,\N-{V)nR\ > 
f3k/3. 

Let M^^ the set of clusters whose successor in F belongs to My^ and define M^^ similarly. 
Note that this yields a partition of Mh. 

It will be helpful later to note that if My^ ^ then we can use clusters in My^ to obtain 
shifted walks from L to R. Similarly, any clusters in My^ can be used to obtain shifted walks 
from R to L. This will use the following lemma. 

Lemma 29. 

(i) For all x G M^^, we have \N^{x) Pi L| < 3r/'n and \N^{x) H R\ ^ j3n/2 
\2ri'n vertices in L have more than |M^^|/4 inneighbours in M^^. 

(ii) For all x £ M^^ , we have \Nq{x) n 5| ^ 3r/'n and \Nq{x) n f | ^ /3n/2 
12rj'n vertices in B have more than | My ■^|/4 outneighhours in My^. 

(iii) For all x £ M§^, we have \N^{x) nR\^ Srj'n and \N^{x)nL\ ^ /3n/2 
\2ri'n vertices in R have more than \Mjj^\/A inneighbours in Mjj^ . 

(iv) For all x G M^^ , we have \N~{x) n T| ^ 3r]'n and \N^{x)nB\ ^ /3n/2 
12ry'n vertices in T have more than \My^\/A outneighbours in My^. 

Proof. For (i), suppose x G AI^^ satisfies \Nq{x) n L\ ^ 3r]'n. Note that Lemma \21\ implies 
that X is typical. Using the definition of 'typical' and accounting for vertices added to Vq we 
still have |A'^^„(x) Ci L\ ^ 2ri'n. Then the cluster U containing x must have (in Rg") at least 
2r]'k outneighbours in L. The definition of M^^ implies that the successor of U lies in 
M^^. Then \N^{U+) n L\ = \N^^^^{U) n L\ ^ 2r]'k, contradicting Lemma [28];i) . We deduce 

that \Nq{x) nL| ^ 3r]'n. It follows that there are at most 3r]'n\M^^\ edges from to L, so 
the final assertion of (i) holds. For the second bound in (i), we note that 

\N+{x) nR\^ 6+{G) - \N+{x) n Z| - \Mv\ - \Vo\ ^ pn- 3r]'n - I7r]n - d^/^n ^ /3n/2. 

For (ii), suppose x G My^ satisfies \NQ{x)nB\ ^ 3r]'n. Then the cluster U G My^ containing x 
must have (in Rq") at least 2ri'k inneighbours in B. Thus in H it has at least 2ri'k inneighbours 
in R, contradicting Lemma [28lfi) . The remainder of (ii) follows as for (i). The proof of (iii) is 
very similar to that of (i) and the proof of (iv) to that of (ii) . □ 

If X and Y are clusters in L, then there are many shifted walks (with respect to Rq" and F) 
from X to Y. Later we will need that paths corresponding to such walks can be found in G, 
even if a large number of vertices in clusters lying on these paths have already been used for 
other purposes. This will follow from the following lemma. 

Lemma 30. Suppose U is a cluster, u £ U and write s = r]'k/4:. 

(i) IfUeRU M^^ then there are clusters Vi, . . . ,Vs £ B such that ViU G E{Rg") and u 
has at least d'm/A inneighbours in Vi for 1 ^ z ^ s. 

(ii) If U G T U M^^ then there are clusters Vi, . . . ,Vs G L such that UVi G E{Rg") and u 
has at least d'm/A outneighbours in Vi for 1 ^ i ^ s. 

(iii) If U G L U My^ then there are clusters Vi, . . . , G T such that ViU G E{Rg") and u 
has at least d'm/A inneighbours in Vi for 1 ^ i ^ s. 



. Also, at most 

. Also, at most 

. Also, at most 

. Also, at most 
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(iv) IfUGBU M^^ then there are clusters Vi, . . . ,Vs & R such that UVi G E{Rg") and u 
has at least d'm/A outneighbours in Vi for 1 ^ i ^ s. 

Proof. To prove (i) recall from Lemma [27] that H[R\ is strongly r/'A:/2-connected, and so has 
minimum indegree at least rj'k/2. Thus any U G R has inneighbours Vi, . . . , in Rq" such 
that Vi G B. This also holds for U G My^ by Lemma I^STii). since /3 ^ r]'. In both cases we 
remove all the Vi for which u does not have at least d'm/A inneighbours in Vi. Then, since u 
is typical (this was defined before Lemma [2T]) . we are left with 2s — e^^'^k ^ s clusters where u 
has at least d'm/A inneighbours. Statements (ii)-(iv) are proved similarly. □ 

8.3. Transitions. As in the highly connected case, our general strategy is to find a suitable 
shifted walk W and transform it into a Hamilton cycle. We will be able to move easily within 
L, and also within R, using the same arguments as in the highly connected case. However, we 
need other methods to move between L and R, which we will now discuss. To avoid excessive 
notation we will just describe how to move from R to L, as our arguments will be symmetric 
under the exchange R ■v^ L (and so B T). To move from R to L we use two types of 
'transitions' from B to L. The first of these is a set of edges Match^L from B to L, which will 
'almost' be a matching, and will have certain desirable properties defined as follows. 

Given matchings Match' and Match" in G from B to L, we call a cluster V full (with respect 
to Match' U Match") if it contains at least 7m endvertices of edges in Match' U Match". Given 
a number £, we say V is i-fair (with respect to Match' U Match") if no cluster with distance at 
most i from 1/ in F is full. A cluster V is (.-excellent if it is ^-fair and no cluster with distance 
at most £ from V in F hes Jn M = My U Mh (the 'middle'). We caU Match' U Match" a 
pseudo- matching from B to L if the following properties are satisfied: 

• Match' U Match" is a vertex-disjoint union of 'components', each of which is either a 
single edge or a directed path of length 2. 

• Every single edge component has at least one endvertex in a 4-excellent cluster, and 
every directed path of length 2 has both endvertices in 4-excellent clusters. 

Given matchings Match' and Match" from T to R, we say that Match' U Match" is a pseudo- 
matching from T to R if it satisfies the analogous properties. As we shall see later, each edge of 
a pseudo-matching from B to L allows us to move from R to L. Note that this applies even to 
the two edges in any directed paths of length 2: these will enable us to move twice from R to L, 
using the rerouting procedure described later. Similarly, each edge of a pseudo-matching from 
T to R allows us to move from L to R. We consider pseudo-matchings rather than matchings 
because in general i? n L 7^ 0, so the largest matching we can guarantee is only half as large as 
the largest pseudo-matching. This would not provide all the edges we need to move from R to 
L. 

We now choose pseudo-matchings Match^i from B to L and MatchTR from T to R, each of 
which is maximal subject to the condition 

• iMatchsil, |MatchTi?| ^ -/"^n. 

(Here iMatcheil denotes the number of edges in MatchsL.) Note that Match^L and Matchy^j 
may have common vertices. Recalling that \M\ ^ Sirjk by ([2]), 

at most 2|MatchBi|/7m ^ 87 A; clusters are full with respect to Match^^, and at most 
11(87^; -|- \M\) ^ 4:00r]k clusters are not 5-excellent with respect to Matchs^. A similar (Jit) 
statement holds for Match.TR. 

From now on, whenever we refer to a fair or excellent cluster it will be with respect to the 
pseudomatching Match b l ■ 

As in the highly connected case, we will identify 'entries' and 'exits' for edges of the cycle 
that do not lie in a pair corresponding to an edge of F. For Match^L, the exits is the set exitsL 
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Figure 2. Partitions avoiding interference between exits/entries and Vq- 

of all initial vertices of edges in Match b l , and the entries is the set entry^j;^ of all final vertices 
of edges in Match^j;,. (We will define further exits and entries in due course.) 

At this stage, we do not know how many of the matching edges we actually will need in W, 
as this depends on a partition of the exceptional set Vq to be defined in the next subsection. 
So, given a cluster V, we want to ensure if e.g. we only use some of the vertices in y n exit^L, 
then the unused remainder of V and still forms a super-regular pair. We may not be able 
to achieve this for any V, but if V is 2-fair, we know that none of V~ , V, is full, which 
gives us the flexibility we need. We say that a cluster V is nearly 2-fair if V is either 2-fair or 
at distance 1 on i*" from a 2-fair cluster. In the following lemma we choose partitions of the 
nearly 2-fair clusters which allow us to avoid any 'interference' between exits/entries and the 
exceptional set. Figure 2 illustrates these partitions, and also some additional sets that will be 
defined in Subsection 18.51 ('twins' of exits/entries and an ideal to preserve super-regularity). 

We define our partitions of the nearly 2-fair clusters as follows. For every 2-fair cluster V with 
V n exits/, 7^ we will choose a partition Vex,i, Vex,2 of V and a partition i, 2 of 
Also, for every 2-fair cluster V with V n entry^^, 7^ we choose a partition Vent,!-, Vent,2 of V 
and a partition Vg~ ^, Vg~ 2 of V~ . There is no conflict in our notation, i.e. we will not e.g. define 
Vex,i twice, since when V (1 exit bl / we must have V £ B, whereas when n entry^^. / ^ 
we must have £ L, so V £ T, and these cannot occur simultaneously. We also define the 
analogous partitions with respect to Match-rij, although for simplicity we will not explicitly 
introduce notation for them, as we will mainly focus on the case when only Match^L is needed 
for the argument. So for each cluster V we will choose at most 4 partitions. We let V2nd be 
the intersection of all the second parts of the at most 4 partitions defined for V. (So if all 4 
partitions are defined, then V2nd is the intersection of the sets Vex,2,Vent,2 defined with respect 
to MatchsL and the 2 analogous sets defined with respect to Match^/j. If only 3 partitions 
are defined for V, then V2nd is the intersection of only 3 sets etc. If no partition is defined for 
V, then V2nd = V-) We let X2nd be the union of V2nd over all clusters V. We choose these 
partitions to satisfy the following lemma. 
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Lemma 31. The partitions Vex,i,Vex,2 cLnd V^nt,i, V'ent,2 can be chosen with the following prop- 
erties (when they are defined). 

(i) \Vex,l\ = m/2, \Vent,l\ = m/l. 

(ii) For any 2-fair cluster V with V n exit^^ / w;e have Vex,2 H exit^^ = 0. Moreover, 
there is a set q C vJ^^ ^ of size at most Wem such that: 

• Each vertex in \ V^^q has at least dm/40 inneighbours in Vex,i \ exit^^. 

• Each vertex in V^^^g has at least dm/ 8 inneighbours in Vex,i- 

• Each vertex in V has at least dm/ 20 outneighbours in ^. 

(iii) For any 2-fair cluster V with Fnentry^j;^ ^ ^ we have Vent,2 Hentry^^ = 0. Moreover, 
there is a set q C ^ of size at most lOem such that: 

• Each vertex in V~ \ q has at least dm /AO outneighbours in Vent,i \ entry^^. 

• Each vertex in V^q has at least dm/8 outneighbours in Vent,i- 

• Each vertex in V has at least dm/ 20 inneighbours in 

Also, the analogues of statements (i) -(iii) for Match.TR hold. Moreover, 

(iv) Every vertex in Vq has at least f3n /20 inneighbours and at least f3n/ 20 outneighbours in 

(v) If d'^i_p^n/2^^^ ^ ""/^ then there are sets S'^ B n X2nd cL^d S*^ C Tn X2nd such that 
\'^b\A'^'t\ ^ l3n/80 and such that every vertex in S'^ U S'rf has outdegree at least n/2 
in G. _ _ 

(vi) If d^i_p^n/2^'^) ^ '^/^ then there are sets S'j^ C LriX2nd and S'j^ C Rr\X2nd such that 
\^l\A^'r\ ^ f3n/ 80 and such that every vertex in 5^U5^ has indegree at least n/2 in G. 

Proof. Consider a 2-fair cluster V with ynexit^z, 7^ 0. If iFnexitsil < 20em we set Vjj^f. = ^■ 
Otherwise, if |y R exitsil ^ 20em we define V^^^ q to be the set of vertices in that have less 
than \exitBL| inneighbours in V\ex.itBL- Recall that {V, F^)g' is (lOe, d/4)-super-regular. 
Since V is 2-fair we deduce that IV^J^jqI ^ lOem. 

Now consider constructing a partition of V into Vex,i and Vex, 2 as follows. Include VCiexitBL 
into Vex,i and distribute the remaining vertices of V between Vex,i and Vex, 2 so that | T4x,i| = m/2 
(recall that m is even), choosing uniformly at random between all possibilities. Note that since 
V is 2-fair the probability that a vertex of V \ exit^L is included in Vex,i is at least 1/3. 
Then by the Chernoff bound for the hypergeometric distribution (Proposition [S]), with high 
probability each vertex in \ V^^q has at least |||V \ exit^Ll ^ dm/40 inneighbours in 
Vex,i \ ex.it BL- Also, by definition of V^^q and super-regularity, each vertex in VJ/^^Q has at 
least dm/ A — ^\V \ exit bl\ ^ dm/8 inneighbours in Vex,i- Next, consider similarly constructing 
a partition of into ^ and Vji^^ 2 as follows. Include q into ^ and distribute the 
remaining vertices of uniformly at random between ^ and 2 so that \Vj^^ i\ = m/2. 
Note that any vertex in V has outdegree at least dm /A — \ V^^q\ ^ dm/ 5 in \ q. Again, 
the probability that a given vertex from \ q is included in ^ is at least 1/3, so with 
high probability each vertex in V has at least {dm/5) /A = dm/20 outneighbours in Vj^^ ^. This 
shows the existence of the partitions required for (ii). The existence of partitions satisfying (iii) 
is proven in the same way. 

For each vertex u in a cluster V which does not lie in exitsL U Vent,o U entry^^ U V"ex,o or the 
analogous set defined with respect to Matchj^ij, the probability that it lies in the second part 
of each of the (up to) 4 partitions defined on V (and thus lies in X2nd) is at least (1/2)^. Since 
6^{G) ^ /3n, a Chernoff bound (Proposition [9|) implies that we can also choose the partitions 
to satisfy (iv). 

Now suppose that '^(^_^)„/2(^) ^ ""Z^- Then G contains at least (1 -|- /3)n/2 vertices of 
outdegree at least n/2. So Lemma [TFTii) implies that B contains a set B[^ of at least /3n/3 
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vertices whose outdegree in G is at least n/2. -B^^rge ™ turn contains a set -Biargc of at least 
/3n/4 vertices which do not lie in exit^L U Ventfl U entry U ^exfi (for any cluster V) or in the 
analogous set defined with respect to MatchTR. Similarly as for (iv), with high probability we 
have l-Biarge n X2nd\ ^ (/3n/4)/20. Similar arguments applied to T, L and R show that we can 
choose the partitions to satisfy (v) and (vi). □ 

8.4. The exceptional set. Next we will assign each vertex x in the exceptional set Vq an 
in-type which is one of T or B and an out-type which is one of L or R. Combining these two 
types together we will say each vertex of Vq gets a type of the form TR, TL, BR or BL. We 
will also abuse notion and think of TL as the set of all vertices of Vq of in-type T and out-type 
L, etc. We write T* for the set of all those vertices which belong to the set X2nd defined in 
the previous subsection as well as to clusters of T which are 5-excellent with respect to both 
Match^^. and Matchy/j. The other sets B* etc. are defined similarly. 

Lemma 32. We can assign each vertex x G Vq an in- and an out-type such that the following 
conditions are satisfied. 

(i) There is a matching Matchy from T* to the set of vertices of in-type T. 

(ii) There is a matching Match^ from B* to the set of vertices of in-type B. 

(iii) There is a matching Match/, from the set of vertices of out-type L to L* . 

(iv) There is a matching Match/? from the set of vertices of out-type R to R* . 

(v) The endvertices of the matchings Matchj', Match^, Match/,, Match/j in V{G) \ Vq are 
all distinct. Let Vq denote the set of all these endvertices. 

(vi) No cluster of Rq' contains more than 7m vertices of Vq . 

(vii) Subject to the above conditions, \\TR\ — \BL\\ is minimal. 

Proof. To show that such a choice is possible, we claim that we can proceed sequentially through 
the vertices of Vq, assigning in- types and out-types and greedily extending the appropriate 
matchings. Since |Vo| ^ d^/'^n by dS]), at any stage in this process we have constructed at most 
2d^^^n edges of the matchings Match/^, Match^, Match/,, Match/j, and so there are at most 
2d^/^n/7 ^ d^/^n vertices belonging to clusters which contain at least 7m endpoints of the 
matchings. In addition, we have to avoid all the at most SOOr/n vertices lying in clusters which 
are not 5-excellent with respect to both Match^L and Match/^/j. So in total we have to avoid 
at most 801r/n vertices in each step. But by Lemma ISTTiv) each exceptional vertex has in- and 
outdegree at least /3n/20 in X2nd) so Lemma [27^ 11) implies that any vertex has at least /3n/50 
inneighbours in Tf^X2nd or at least /3n/50 inneighbours in Br\ X2ad- A similar statement holds 
for outneighbours in L fl X2nd or Rr\ X2nd-, Thus a greedy procedure can satisfy conditions 
(i)-(vi), and then we can choose an assignment to satisfy (vii). □ 

Note that one advantage of choosing Vq in X2nd is that Vq will be disjoint from the sets 
Entry^/^ etc. The strategy of the remaining proof depends on the value of \TR\ — \BL\. We will 
only consider the case \TR\ — \BL\ ^ 0, as the argument for \TR\ — \BL\ ^ is identical, under 
the symmetry L ■v^ R, T ■v^ B. When \TR\ > \BL\ only Match^L is needed for the argument. 
When \TR\ = \BL\ we do not need either pseudo-matching, although the case \TR\ = \BL\ = 
has additional complications. 

8.5. Twins. When \TR\ > \BL\, we obtain one type of transitions from i? to L by fixing a 
pseudo-matching Match^^ C MatchgL. The other type of transitions uses a set Entry /j/, C 
My^, as explained below. We define exits Exit^L ^ exit^L and entries Entry^/, C entry 5/, of 
Match'^/, as for Match^/,. Lemma [3¥ i) below implies that |MatchB/,| + \M^^\ ^ \TR\ - \BL\. 
Thus we can fix sets Match^^^ and Entry /j/, to satisfy 

|Match^/,| + |Entry/j/,| = \TR\ - \BL\. 
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For each edge xy £ Match^j;^ we will choose 'twins' x*"**" and y*'"*" of its endpoints. To use 
the edge xy in our shifted walk W, we will enter the cycle of F containing x at x*"'*", wind 
around the cycle to x, use the edge xy, wind around the cycle containing y, and then leave it 
at y*"'*"-. A vertex that is the midpoint of a directed path of length 2 in Match^^^ will actually 
have two twins, but we will not complicate the notation to reflect this, as it will be clear from 
the context which twin is intended. Thus we obtain two 'twin maps' x i— )• x*"'*" and y i— )> y**"'". 
We also use the notation S"*""*" = {x*"'*" : x € S} when S" is a set of vertices. The twin maps will 
be injective on ExitsL and on Entry^^.) that |Exit_BL| = lExit^/""!' I^ntry^^;^! = |Entry^2"l) 
and moreover \V n Exit bl\ = n Exitg^'^l, \V n Entry^Ll = {V' n Entry*^£"|. 

Our choice of x*"'*"' depends on whether the cluster V containing x is 2-fair with respect to 
Match^^.- If V is not 2-fair then we fix arbitrary perfect matchings in G' from V~ to V and 
from V to (using Lemma [T2|) . Then for every x £ V D Exit^^, we let x*"'*" be the vertex x 
is matched to in and for every x £ V Entry^j^ we let x*"'*" be the vertex in V~ matched 
to x. 

On the other hand, if V is 2-fair then we make use of the partitions defined in Lemma [3T1 If 
V n Exit^i 7^ then we choose twins for vertices in F n Exit^^, within {V^^ 2 U Kn* 0) \ ^0*' 
arbitrarily subject to the condition that if |y nExitsil > 20em then n Exit^L)*"'*"' contains 
^ento- (Recall that V^* was defined in Lemma ISST v).) If V is 2-fair, we will also choose 
some ideal of {V \ Ex.itBL,V~^ \ Exit^2")G' to create flexibility when selecting further sets 
while preserving super-regularity. To do this, recall that {V, V~^)g' was {We, d/4)-super-regular. 
Together with Lemma[3lli),(ii) this implies that {Vex,i\^^^^BL, V;+4_i\Exit*^^")G' is (30e, d/40)- 
super-regular. Next we randomly choose sets Vex ^ Vex,! \ExitBL and V^^. C Vj^^ ^ \Exit^£" of 
size 80dm. Lemma [15] (applied with 9 = 160d and n = m/2) implies that with high probability 
(^ex,Kni) is an (V^,d2)-ideal for {Vex,i \ ExitBL,V;+, i \ Exitg^")^. Moreover, Lemma E^ii) 
and the Chernoff bound (Proposition [9|) together imply that with high probability every vertex 
in Vex,2 has at least (Pm outneighbours in V^f. while every vertex in Vji^^ 2 I^as at least (fm 
inneighbours in Vex- Altogether this shows that we can choose (Vex, Knt) t° be a (-^/e, (i^)-ideal 
for (y \ ExitBL , V+ \ Exit*3T)G' . 

Similarly, if FnEntry^^. / ^ then we choose twins for vertices in FnEntry^^. (Ki- 2'^^ex o)\ 
Vq, arbitrarily subject to the condition that if |y n Entry^^^l > 20em then {V D Entry^j^)**"*" 
contains V^ q. We also choose a {^/£, (P)-idea\ {V^, Vent) for (y~ \ Entry^", V \ Entry^^)^'. 
Then we define Xbl to be the union of the sets Vex and Vent defined using Match'^j;^ over all 
nearly 2-fair clusters V. Note that these sets will play a similar role to the sets V* used in 
the highly connected case, in that they preserve super-regularity even if when some vertices are 
deleted. We let 

X*BL ■■= Xbl U ExitBL U Entry^^ U Exit^/" U Entry^i" . 

Note that X^j^ CiVq = 9. Define Xtr and X^j^ similarly using the matching Match^^^. 

We will also choose twins for vertices in Entry^^^ such that if x £ V £ My^ then x*'"*" G 
V~ G M^^. Lemma [2Wiii).(iv) implies that each x G Entry^ij^ has many inneighbours in B 
while x*"'*" has many outneighbours in L. Writing C for the cycle of F containing the cluster 
containing x, we get a transition from S to L by entering C at x from an inneighbour in B, 
traversing C, then exiting C at x*"'™ to an outneighbour in L. 

Now we describe how to choose twins for Entry^^, and also some ideals to create flexibility 
while preserving super-regularity. Call a cluster V M^^-fuU if it contains at least 7m vertices 
in Entry ^j^. Say V is i-good (with respect to Match^L and Entry ^j^) if V is ^-fair with respect 
to Match^L and no cluster with distance at most i from F on F is M^^-full. Since [Entryjij^,! ^ 
\Vo\ ^ (i^/^n the number of M^^-full clusters is at most '^~^d}/'^n/m. Recalling that by {ift) at 
most 37A; clusters are full, 

at most 9(87/0 -|- 7~^(i^/^n/m,) ^ 3O7A; clusters are not i-good. (0) 
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Consider a cluster V € My^ with ypi Entry ^j;^ 7^ 0. IfV is not 2-good then we choose a perfect 
matching in G' from V~ to V (using Lemma [T^ . and for each x G y let x*"**" be the vertex 
in V~ that is matched to x. Now suppose that V is 2-good. Then none of V~, V and 
is full with respect to Match^/, or M^^-full. Since {V^,V)g' is (lOe, (i/4)-super-regular and 
I y n Entry 1 < 7^ '^^ can apply Lemma [HI to obtain a set Y C with |y| = | y n Entry^j;^ | 
such that {V~ \ Y,V \ Entry ^2.)g' is (20e, d/8)-super-regular. Then we let the twin map be an 
arbitrary bijection from l/pl Entry to Y. Next we apply Lemma [15] with 9 = 32d to obtain a 
{^/e, d2)-ideal for {V~ \ Entry*^^", V \ Entry which we wiU call (V", Ven*)- (Similarly to 
the earlier argument and the one in the next paragraph, the partitions of V{G) \ Vq into L, R 
and My and into T, B and Mh guarantee that there is no conflict with our previous notation.) 
Now we define 

X*RL-= U U U Entry^i U Entry^T, 

X* ■.= X*sL^X*RL- (4) 
Then \X*\ ^ -yn and X* n V^* = 0. (The latter follows since the vertices in Vq lie in 5-excellent 
clusters and so X^^j^ n VJj* = 0.) We also define 

Ventry ■-= V D (Entry^L U Exit*^2" U Entry^^), 

Ve.^t ■■=Vn (ExitsL U Entry*™" UEntry*™"). ^ ^ ^ 

Note that Entry^^ C L, Exit|^2" ^ ^ and Entry^^ C M^^. Since L, i2 and My partition 
V{G) \ Vq, any vertex will be used at most once to enter a cluster. In particular, 

VnEntry^i ii V £ L; 
V n Exit*^£" \iV eR; 
^Fn Entry iiVeM^^. 

Similarly, ExitijL C 5, Entry*^/" C f and Entry^^^*" C M^^ . Since f , B and also partition 
V{G) \ Vq, any vertex will be used at most once to exit a cluster and 

VnExitijL ifFGS; 

V n Entry*^2" '^^V ^T- 

_ F n Entry*^2" if ^ ^ . 

Some vertices may be used for both exits and entrances, and they will have two twins. We 
summarise the properties of twins with the following lemma. 

Lemma 33. Suppose that \TR\ > \BL\. Then 

(i) iMatch^^l + lEntry^il = \TR\ - \BL\. 

(ii) Every cluster intersects at most one 0/ Entry^^, Exit^", Entry Similarly, every 
cluster intersects at most one o/ExitsL, Entry^"", Entry^". 

(iii) There exists a perfect matching from V \ Vexu to \ V^^,^y . 

(iv) Suppose V is 3-good with respect to Match^i and Entry ^2.- Then 

• For all sets X' and Y' with {VnX*)\Vexit ^ X' O V\Vexit and {V+nX*)\V^+t,.y C 
Y' C \ V^f-^y the pair {X' ,Y')qi is {^/e,d^)- super-regular. 

• For all sets X' and Y' with {V r\X*)\Ventry <^ X' Q V\Ventry and {V-nX*)\V^';^.^ C 
Y' C V~ \ V^-^ the pair {Y',X')g' is {^/e,d'^)- super-regular. 



entry 



exit 



Proof. As discussed at the beginning of the subsection. Lemma [Mli) will allow us to choose 
Match^^. and Entry ^j;^ of the size required in (i). Property (ii) was discussed above. We will 
just consider the first point of property (iv), as the second is similar. Suppose V is 3-good 
and consider sets X' and Y' as in the statement. We need to show that {X\Y')qi is {^/e,d^)- 
super-regular. If F n Entry^" / this holds by definition of Xj^^ since Vex n VexU = and 
^itnt^^entry = by (ii) (and so Vex Q X' and 1/+^ C Y'), and since y+ is 2-good. If FnExitsL / 
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this holds by definition of X"^^, since V is 2-fair, and similarly, if F nEntry^2" / ^ this holds 
again by definition of X"^^, since is 2-fair. It remains to prove property (iii). Suppose first 
that V G M^^. If is not 2-good then the required matching exists by the way we defined 
twins for Entry ^j;^ in this case. On the other hand, if is 2-good then we can apply the 
super-regularity property (iv) just established (which only used the fact that is 2-good) and 
Lemma [12 Next suppose that V £ B. Then Vexu = Vn Exit bl and Vg+^^j^ = V+ n Exit ^2"- 
Thus if V is not 2-fair then the required matching exists by the way we defined twins for Match^j;^ 
in this case. On the other hand, if V is 2-fair then we can apply the first point of property (iv), 
which only used the fact that V is 2-fair. Similarly, when V £ T, then VexU = V Ci Entry|^" 
and V^^^y = H Entry^j^ . If is not 2-fair the required matching exists by the construction 
in this case, whereas if is 2-fair then we can apply the second point of property (iv) with 
(y, V~^) playing the role of {V~ , V), which only uses the fact that V'^ (playing the role of V in 
the second point) is 2-fair. □ 

8.6. Summary. The auxiliary graph H is decomposed into shifted components L 'left' and R 
'right' of size A;/2 it 19r]k and a set My of size \Mv\ < 17r]k. This corresponds to a partition of 
V{G)\Vo = LURUMy. The 1-factor F defines a partition V{H) = TUBUMh, where a cluster 
V belongs to T, B, Mh if and only if its successor belongs to L, R, My respectively. The 
shifted walk W will use two types of transitions from B to L. One type is a pseudo-matching 
Match^^ from B to L, matching Exit^/, to Entry^^. The other type is a set Entry^ij^ of vertices 
in My^ C My, with the property that if y E My^ then any x has many inneighbours in 
B and any y E V~ has many outneighbours in L. We did not discuss transitions from T to R, 
but these are obtained similarly under the transformation L ■v^ R, B ^ T , etc. Each vertex in 
these sets has a twin (or possibly two twins) that will be used when W traverses the cycle of F 
containing it. For any cluster V , the set of exit points from V is VexU and the set of entry points 
to V is Ventry There exists a perfect matching from V\Vexit to V~^\V^f^^y. The exceptional set 
Vo is decomposed into 4 parts TR, TL, BR and BL, where the first letter gives the in-type of 
a vertex and the second letter the out-type: there is a matching Match-p from T* to vertices of 
in- type T (and so on). Technical complications are created by the possibility that a cluster may 
be full (contain at least 7m endpoints of MatcheL) or M^^-full (contain at least 7m endpoints 
of Entryjij^). A cluster V is ^-fair if no cluster at distance at most £ from V is full, ^-excellent 
if no cluster at distance at most £ from V is full or in M = My U Mh , and ^-good if no cluster 
at distance at most i from V is full or M-^^-full. We have a set X* = U such that 
whenever V is 3-good, we have fiexibility to use any vertices avoiding these sets in V~ , V and 
(as well as avoiding the exits and entries already chosen), while preserving super-regularity 
of the corresponding pairs in F. Finally, the set V^* of endpoints in V{G) \ Vq of the matchings 
Matchr etc. only uses 5-excellent clusters and avoids X* . 

9. The size of the pseudo-matching 

Our aim in this section is to prove the following lower bound on the size of our pseudo- 
matchings Match^j;, and Match.TR- 

Lemma 34. 

(i) iMatchsLl ^ min{|M^^|/2,7'^n} - |My^| - |Vb|. Moreover, if \TR\ > \BL\ then 
iMatchsil ^ |Ti?| - \BL\ - |M^^| + min{|M^^|/2, 7''n}. 

(ii) |MatchTi?| ^ min{|M^^|/2, 7^71} - |M^^| - |Vb|- Moreover, if \BL\ > \TR\ then 
iMatchTfil > \BL\ - \TR\ - \M^^\ + min{|M^^|/2, 7'in}. 

To prove this we first show that there are large sets Sb C B with many outneighbours in L 
and Sl L with many inneighbours in B. Note that part (v) in the following lemma is not 
used in the proof of Lemma [Ml but will be needed in the final section of the paper. 
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Lemma 35. 

(i) // \TR\ > \BL\ there is Sb Q B with \Sb\ ^ /3n/100, such that every x € Sb satisfies 

\N+{x) n L| ^ deg^ := ^ - {\BL\ + \BR\ + \R\) - |M^^| - |M^^|/4. 

Furthermore, in any case, B contains a set of size |5^| ^ /3n/100, such that every 
xe S% satisfies \N+{x) n Z| ^ f - \Vo\ - \R\ - |M^^| - |M^^|/4. 

(ii) // \TR\ > \BL\ there is Sl Q L with \Sl\ ^ /3n/100, such that every x & Sl satisfies 

\N^{x) nB\^ degB := ^ - {\TL\ + \BL\ + \f\) - |M|^| - |M^^|/4. 

Furthermore, in any case, L contains a set Sf^ of size |S'2| ^ /3n/100, such that every 
xe Sl satisfies \N^{x) n 5| ^ § - l^ol - \T\ - \M^^\ - |M^^|/4. 

(iii) // \ BL\ > \TR\ there is St '^T with \St\ ^ /3n/100, such that every x G St satisfies 

\N+{x) nR\^ degji := ^ - {\TL\ + \TR\ + \L\) - |M^«| - |M^^|/4. 

Furthermore, in any case, T contains a set S^ of size \S^\ ^ /3n/100, such that every 
xeS^ satisfies \N+{x) n ^| ^ § - |Vb| - \L\ - |M^^| - |M^^|/4. 

(iv) // \BL\ > |Ti?| there is Sr C R with \Sr\ ^ (3n/W0, such that every x £ Sr satisfies 

\N-{x) n f I ^ degT :=-- {\TR\ + \BR\ + \B\)- \Mh^\ - |M|^|/4. 

2 

Furthermore, in any case, R contains a set of size \Sl^\ ^ /3n/100, suc/i i/iai ewery 
X G 5^ satisfies \Nq{x) n f | ^ § - |Vb| - 1^1 - \M^^\ - |M^^|/4. 

(v) Finally, suppose that My^ , TR and BL are all empty. 

• // |L U TL\ ^ \BL) BR\, then B contains a set Sb of at least /Sn/lOO vertices, each 
having at least |My^|/4 outneighbours in LU TL. 

• If \LL) TL\ ^ \BL) BR\, then L contains a set Sl of at least /3n/100 vertices, each 
having at least \My^\/4 inneighbours in BU BR. 

Proof. Suppose that \TR\ > \BL\. To prove (i), we first consider the case when '^(i_^)„/2(^) ^ 
n/2. Let S'^ be as defined in Lemma [3TT v). Let Sb be the set obtained from S'^ by deleting 
the following vertices. 

• The set of 2|yo| ^ 2d^/^n endvertices in V{G) \ Vq of edges in Matchr, Matchs, 
Matchi, Matchij. 

• All the at most SOOrju vertices which lie in clusters that are not 5-excellent with respect 
to Matchsi or Match-rR. 

• All the at most 2| Vo| A;/(7m/2) ^ d}^^n vertices which lie in clusters containing at least 
7m/2 vertices of Vq . 

• All the at most 12rj'n vertices in B having more than \My^\/4: outneighbours in My^ 
(see Lemma [29^11) ) ■ 

Thus \Sb\ ^ /3n/100. Now we make the following key use of the minimality of |Ti?| — \BL\ > 0. 
We claim that any vertex x G Sb has outdegree at most \BL\ + \BR\ in Vq. Otherwise, there 
would be some edge xy with y G TL U TR. But then we can change the in-type of y to B by 
deleting the edge in Matchy incident to y and adding the edge xy to Match^. Conditions (v) 
and (vi) in Lemma [32] will still hold, since Sb is disjoint from Vq and only contains vertices in 
clusters containing at most 'ym/2 vertices of Vq. Condition (ii) holds since x G -B* by definition 
of Sb- This reduces | \TR\ — \BL\\, which contradicts the minimality condition in Lemma IS^Tvii). 
Therefore the claim holds. Now recaU that RULU M^^ U M^^ U Vq is a partition of V{G). 
Any X G Sb has at least n/2 outneighbours, of which at most \R\ + \My^\ belong to i? U My^, 
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at most |My^|/4 belong to Aly^ and at most \BL\ + \BR\ belong to Vq. This shows that Sb 
is a set as required in (i). 

Now consider the case when d.^i_i^)n/2^^^ ^ '^Z^' ^^'^ ^(i-/3)n/2(^) ^ (1 + /5)re/2 by our 
degree assumptions. Then G has at least (1 + P)n/2 vertices of indegree at least (1 + /3)n/2, 
and by Lemma [27] at least (1 + l5)n/2 - \R\ - |Vb| - Wv\ > /3n/5 of these belong to L. Let 
A C L be a set of /3n/3 vertices with indegree at least (1 + f3)n/2. Note that every vertex 
in A has indegree at least (1 + P)n/2 + \B\ — n > j3n/'i in B. Then we must have a set Sb 
of at least /3n/100 vertices in B, each having outdegree at least j3^n in A, or we would have 
/3n/3-|A| ^E{B,A) ^ I3n/im ■\A\+ l3^n\B\, a contradiction. Then every vertex in Sb has at 
least /3^n ^ ^ — outneighbours in A C L, as required. This completes the proof of (i) when 
\TR\ > \BL\. 

The argument for (i) when we do not have |Ti?| > \BL\ is the same, except that we no 
longer have the minimality argument for \\TR\ — \BL\\, so vertices in may have all of Vq as 
outneighbours. The arguments for (ii)-(iv) are analogous, so we omit them. 

Finally, suppose that M^^, TR and BL are all empty, so that RULU M^^ UTLU BR 
is a partition of V{G). For the first point in (v), suppose that \L U TL\ \B U BR\. Since 
\B\ = \R\, every vertex x in Sb (defined as above) has at least n/2 — \BR\ — \B\ — |My^|/4 
outneighbours in LUTL. By assumption, we have \BR\ + \B\ + \My^\/2 ^ n/2, so the number 
of outneighbours of x in L U TL is at least \My^\/A, as required. The second point follows in 
the same way. □ 

Proof of Lemma \ 34\ Observe that all stated lower bounds are at most 7^n, so it is enough 
to prove the existence of pseudo-matchings satisfying these bounds. We will suppose that 
\TR\ > \BL\ and prove the 'moreover' statement of (i); the arguments for the other assertions 
are similar. Define an auxiliary bipartite graph whose vertex classes are B and L by joining a 
vertex x G .6 to a vertex y £ L if xy is an edge of G. Let J be the graph obtained from this 
bipartite graph by deleting all the edges whose endvertices both lie in clusters having distance 
at most 4 in from M. Let Q be the largest matching in J. 

Case 1. IQI ^ -/^n. 

Let Q' be a matching obtained from Q by deleting as few edges as possible so as to ensure 
that every vertex of G belongs to at most one edge from Q' . Note that every vertex of G has 
indegree at most 1 and outdegree at most 1 in Q, so Q considered as a subdigraph of G is a 
vertex-disjoint union of directed paths and cycles. Thus we can retain at least 1/3 of the edges 
of Q in Q' (with equality for a disjoint union of directed triangles). By deleting further edges if 
necessary we may assume that \Q'\ = 7^n/3. 

We claim that there is a submatching Q" of Q' of size at least j\Q'\/3 such that no cluster is 
full with respect to Q" (i.e. every cluster contains at most 7m endvertices of Q"). To see that 
such a Q" exists, consider the submatching Q" obtained from Q' by retaining every edge of Q' 
with probability 7/2 in Q", independently of all other edges of Q' . Then for any cluster V, the 
expected number of endvertices of Q" in V is at most 7m/2, and the expected size of Q" is 
7|Q'|/2. By Chernoff bounds we see that with high probability Q" has the claimed properties. 

Note that Q" is a pseudo-matching from B to L, as by construction it is a matching, and by 
definition of J every edge in Q" has an endvertex in a 4-excellent cluster. Also, since | Vol ^ d^/^n 
we have \Q"\ ^ ^ | Vb| + 'y^n ^ \TR\ - \BL\ - \M^^\ + 7^n, as required. 

Case 2. \Q\ ^ -^'^n. 

Let j4 be a minimum vertex cover of J. Then |j4| ^ 7^n by Konig's theorem (Proposition [6]). 
Write Ab '■= Ai^B and Al := ACiL. We say that a cluster V is A-full if it contains at least ^m/2> 
vertices from A. We say that V is A-excellent if no cluster of distance at most 4 from 1/ on F is 
^d-full or lies in M. Note that at most 7^71/(7771/3) = Sjn/m clusters are 74-full and thus by ([2]) 
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all but at most 9(37n+|Af |m) ^ 350r/n vertices lie in ^-excellent clusters. Since \TR\ > \BL\ we 
can construct the sets Sb and Sl given by Lemma [35lfi) and (ii). Let S'^ be the set of all those 
vertices in Sb \A which lie in 74-excellent clusters. Thus \S'^\ ^ \Sb\ — j'^n — 350?7n ^ /3n/101. 
Moreover, iV+(5^) n L C Al, since none of the edges deleted in the construction of J were 
incident to S'^. 

Now we greedily choose a matching Match i from S'^ to yl^, C L of size deg^, (defined in 
Lemma [35lfi)) in such a way that every cluster contains at most 7m/3 vertices on the B-side 
of J. To see that this is possible, note that at any stage in the process we have excluded at 
most 1^1/(7771/3) < /3n/101 ^ vertices in S'^, so we can always pick a suitable vertex x 
in S'^. Then, since we have chosen less than deg^ vertices in Al, we can choose an unused 
outneighbour of x in L (which lies in the cover A, so in Al). 

Let S'^ be the set of all those vertices m Sl\A which lie in ^-excellent clusters and are not 
endvertices of edges in Matchi. Then \S'^\ ^ \Sl\ — "f'^n — 350ryn — 2|Matchi| ^ /3n/101 and 
Nq{S'j^) n B Q Ab- As before, we can greedily choose a matching Match2 from Ab Q B to S'^ 
of size deg^ in such a way that every cluster contains at most 'ym/3 vertices on the L-side of J. 

Note that every ^-excellent cluster is 4-excellent with respect to Matchi U Match2, as it 
contains at most 7m endvertices of edges from Matchi U Match2 (it is not vl-full), and so is not 
full with respect to Matchi U Match2. Also, any edge e in Matchi U Match2 has one endvertex 
in A and one endvertex outside A. The endvertex outside A is that in S'^ (if e G Matchi) 
or 5*^ (if e € Match2). So the endvertex outside A is not an endvertex of another edge from 
Matchi U Match2 and lies in a cluster which is 4-excellent with respect to Matchi U Match2. 
We deduce that Matchi U Match2 is a disjoint union of edges and directed paths of length 2 
satisfying the definition of a pseudo-matching from B to L. Moreover, since 
and |M^^| = \Mj^^\, we have 

|Matchi U Match2| = degj^ + deg^ 

= n - i2\BL\ + \BR\ + \TL\ + + |f |) - 2|M^^| - 2|M^^|/4 

= \TR\ - \BL\ + {n- \Vo\ - \R\ - \L\) - 2\M^^\ - |My^|/2 

= \TR\ - \BL\ + \Mv\ - 2\M^^\ - |My^|/2 

= \TR\ - \BL\ - \M^^\ + \M^^\/2, 

as required. The proof of the first statement of (i) is the same, except that we use the 'further- 
more' statements of Lemma I^STi) and (ii) in the final calculation instead of working with deg^ 
and deg^. □ 

10. Proof of Theorem O 

In this section we use the matchings and sets constructed in Section [S] to prove Theorem O 
We will assume that H is not strongly r/fc-connected, as we have already covered this case in 
Section [7] (although it could also be deduced from the arguments in this section). Our strategy 
will depend on the value of \TR\ — \BL\, and also on the size of middle, as described by the 
cases (*) or (★*) above. We divide the proof into three subsections: the first covers the case 
when |ri2| 7^ \BL\ and (*) holds, the second when (**) holds, and the third when |Ti?| = \BL\ 
and (*) holds. 

10.1. The case when |Ti?| 7^ \BL\ and (★) holds. We will just give the argument for the 
case when \TR\ > \BL\, as the other case is similar. We recall that (*) is the case when 
\M\ ^ 1^01/7"^ and |C fl M\ < |C|/10 for every cycle C of F. In this case we will use the 
pseudo-matching Match^^ as well as the additional transitions from B to L which we get from 
Entry U Entry^". We want to construct a walk W with the same properties as in the proof 
of the case when H is highly connected. 
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Recall that both H[L\ and H[R\ are strongly r7'fc/2-connected (see Lemma [27]) . Then by 
arguing as in the proof of Lemma [22] for the graph H[L\ instead of H we deduce that for any 
two clusters V,V' & L we can find ?7'^A;/64 shifted walks (with respect to Rg" and F) from V to 
V such that each walk traverses at most 4/r/' cycles from F and every cluster is internally used 
by at most one of these walks. A similar statement holds for any two clusters in R. For any two 
clusters V and V', we call a shifted walk (with respect to Rq" and F) from V to V useful if it 
traverses at most 4/ry' cycles from F and if every cluster which is internally used by the walk is 
4-excellent (note that 4-excellence is defined with respect to pseudo-matching Match^^ in this 
case, and the pseudo-matching Match^^ is irrelevant). Since all but at most AOOrjk clusters are 
4-excellent (by (Jft)) we have the following property. 

Whenever V is a set of at most ri''^k/lQO clusters and V,V' £ L there exists a useful 
shifted walk from V to V that does not internally use clusters in V. A similar statement (^) 
holds for any two clusters in R. 

We incorporate the vertices of the exceptional set Vq using whichever edges in the matchings 
Matchy, Matchs, Match^,, Match^ correspond to their in-types and out-types. Suppose for 
example that we have just visited a vertex x of out-type L, arriving via some edge in Match^^ 
or Matchs (depending on the in-type of x) and leaving to its outneighbour x"*" in the matching 
Match 2,. Then x"*" belongs to some cluster U in L. Since H[L\ is highly connected we can 
proceed to incorporate any vertex y of in-type T as follows. Let y~ be the inneighbour of y in 
Matchr. Then y~ belongs to some cluster V of T, and so by definition of T the successor 
of y on F is a cluster of L. Let W'^y = XiCiX^ X2C2X2 ■ ■ ■ XtCtX^ Xt+i be a useful shifted 
walk with Xi = U and X^+i = V~^. Let Ct+i be the cycle of F containing X^+i = and form 
Wxy by appending the path in Ct+i from to V. So for any cycle C of F the clusters of C 
are visited equally often by Wxy Then in the construction of W we can use the walk Wxy to 
move from x to y. Note that since we chose Matchj^, Match^, Match^, Match to use at most 
7m vertices from any cluster and since |Vo|/(7m) <^ r]''^k, (^) implies that we can avoid using 
any cluster more than 37m times (although we may visit a cluster more often). 

Thus we see that the structure of H allows us to follow any vertex of out-type L with any 
vertex of in-type T, and similarly we can follow any vertex of out-type R with any vertex of 
in-type B. In particular, we can incorporate all vertices of type TL sequentially, all vertices 
of type BR sequentially, and vertices of type BL or TR can be incorporated in an alternating 
sequence, while there remain vertices of both types. This explains the purpose of condition (vii) 
in Lemma[32J choosing \ \TR\ — \BL\\ to be minimal. 

We order the vertices of Vq as follows. First, we list all vertices of type TL (if any exist). 
These will be followed by an arbitrary vertex of type TR (which must exist as \TR\ > \BL\ ^ 0). 
Then list all vertices of type BR (if any exist). Then we alternately list vertices of type BL 
and TR until all vertices of type BL are exhausted. Finally, we list all vertices of type TR (if 
any remain). So the list by type has the form: 

TL,...,TL\TR\ BR, . . . , BR \ BL,TR, . . . , BL,TR\TR, . . . , TR. 

We can follow the procedure described above to incorporate all vertices in the list apart from 
the final block of \TR\ — \BL\ — 1 vertices of type TR. At this point the above procedure would 
require a shifted walk from R to L, which need not exist. For these remaining vertices we will 
use the \TR\ — \ BL\ transitions from B to L formed by the matching Match^j;^ and the vertices 
in Entry U Entry^^*". (We need |ri?| - \BL\ transitions rather than \TR\ - \BL\ - 1 since 
we need to close the walk W after incorporating the last exceptional vertex.) Suppose we have 
just visited an exceptional vertex a of type TR, leaving to its outneighbour a"*" in the matching 
Match^, and we want to visit another vertex b of type TR, with inneighbour b~ in the matching 
Match^. Let U be the cluster of R containing and V the cluster of T containing b~ . We 
pick an unused edge xy of Match'^j^ , where x belongs to a cluster X ot B and y to a cluster Y 
of L. Recall from Subsection 18.31 that x G exit^L and y G entry^^ have twins x*""*" G X+ G R 
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Figure 3. Transitions using Match^L 

and y*'"*" G G T. By Lemma [5U1 there are X' £ B and Y' ^ L such that x*"**" has at least 
d'm/A inneighbours in X' , whereas y*'"™ has at least d'm/A outneighbours in Y' . We can also 
choose X' and Y' to be 4-excellent, since by Lemma [501 there are at least ij'k/A choices for both 
X' and Y', and at most AOOrjk <^ rj'k/A clusters are not 4-excellent (by (4k)). Choose a useful 
shifted walk Wi from U to the F-successor {X')'^ of X' and a useful shifted walk W2 from Y' 
to the F-successor of V. Wi and W2 exist by (<^), since (X')+ G i? as X' G S and y+ G L 
as y G T. Thus, as illustrated in Figure 3, we can form a segment of the walk W linking a to 6 
by first following Wi to (X')"*", then the path in F from [X')^ to X' , then the edge X'X^ , then 
the path in F from X"*" to X, then the edge xy, then the path in F from y to y~, then the edge 
Y~Y' , then W2 to V^, and finally the path in F from 1/+ to y. When we are transforming 
our walk W into a Hamilton cycle we will replace X'X~^ with an edge of G from some vertex 
in X' to x*'^*" and replace Y~Y' with an edge from y*""*"- to some vertex in Y' . So we say that 
^twin prescribed endvertex for this particular occurrence of X'X^ on and that y*'"™ is a 
prescribed endvertex for this particular occurrence of Y^Y' on VF. The vertices x and y will be 
prescribed endvertices for the edge xy on W^. (We will also define other prescribed endvertices 
on W, and if they are not endpoints of Match^^ then they will always be such that they have 
at least d'm/A inneighbours in the previous cluster on W or at least d'm/A outneighbours in 
the next cluster on W. Note our eventual Hamilton cycle may not follow the route connecting 
jtwin^ x, y and y**"*" used here since we will use a rerouting procedure which is similar to that 
in the case when H is highly connected.) 

We use different matching edges from Match^j;^ for different vertices of type TR. After having 
used all of Match^^^, we use the \TR\ — \BL\ — iMatch^^^l transitions from B to L which we get 
from Entry U Entry^*^ instead. If X G My^ is the cluster containing x G Entry and so 
X~ G M^^ is the cluster containing its twin x*""*" G Entry^", then, using Lemma [301 again . we 
can choose 4-excellent clusters X' € B and X" G L such that x has at least d'm/A inneighbours 
in X' whereas x*"'™ has at least d'm/A outneighbours in X". We then take Wi to be a useful 
walk from U to the F-successor {X')~^ of X' and W2 to be a useful walk from X" to . Then 
when we are transforming W into a Hamilton cycle we will replace X'X with an edge of G from 
some vertex in X' to x and replace X^X" with an edge from x*""*" to some vertex in X" . So 
we say that a; is a prescribed endvertex for this particular occurrence of X'X on W and that 
^twtn -g ^ prescribed endvertex for this particular occurrence of X~X" on W. 

At the moment we have constructed a walk W which starts in the cluster U* £ T containing 
the inneighbour of the first exceptional vertex in our list, then goes into that vertex and then 
joins up all the exceptional vertices. After visiting the last exceptional vertex of type TR, W 
follows our last transition from B to L and ends in some 4-excellent cluster V* £ L. (Using 
the same notation as above, if this last transition was a matching edge xy G Match^j;^ then 
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V* = Y', and if it was a transition formed by a vertex x E Entry^^ and its twin a;*""*" then 
V* = X" .) Say that a cluster V is nearly A-good if V is either 4-good or at distance 1 on F 
from a 4-good cluster. (A nearly 4-good cluster is 3-good, but not conversely.) Note that since 
all walks above were useful, the walk W constructed only uses nearly 4-good clusters, except 
when it uses a prescribed endvertex. Using (^), it is easy to check that we can choose W in 
such a way that every nearly 4-good cluster is used at most 97m times. 

Before closing up the walk W ^ we have to enlarge it by some special walks W^"'^, Wl"""^ , 
Ty^"^ and Wl^°'^ which will ensure that we can actually transform W into a Hamilton cycle 

of G (rather than a 1-factor). We start by defining W^°'^ . List the 4-good clusters in L as 
for some s, where Vi = V* . Choose useful shifted walks Wi from Vi to l^+i, for 
i = 1, . . . , s, where Vs+i:=V*. Let Wl"""^ := VFi . . . W,. Then Wl"°'^ is a shifted walk from V* 
to itself, which uses every 4-good cluster in L at least once, and which only uses nearly 4-good 
clusters. 

Call a cycle in F had if it does not contain a 4-good cluster lying in L U i?. For every bad 
cycle we pick a cluster whose distance from M on F is at least 2. (This is possible since we 
are in case (★), when no -F-cycle significantly intersects M.) We let Zl and be the sets of 
clusters in L and R thus obtained. Then no cluster Z & Z^U Zn is nearly 4-good, since Z 
has distance at least 2 from M on and so the neighbours of Z on F cannot be 4-good by 
definition of 'bad'. In particular, no cluster in Zl U Zji is 4-good, so \Zi\, \Zr\ ^ 3O7/C by (0). 

The purpose of the walk Wf^'^ is to 'fill up' each cluster in Z^: W^'^ will ensure that W enters 
each such cluster precisely m times. (Recall that this notion was defined in Section [5l) List the 
clusters in Zl that are not already entered m times as Z^, . . . , Z*, and let Oj := m — |Zg„^^y|, 
where Z\^i^.y is as defined before the statement of Lemma [33l (Recall that is not nearly 4- 
good and so W only enters in vertices which are prescribed.) Let [/* € T be the -F-predecessor 
of Z*. Let z^, . . . , z*^ be the vertices in \ Z\^^^y and n^, . . . , u\. be the vertices in [/* \ V^^n- 
(Lemma [33lf iii) implies that \Z\^^^y\ = \Ul^^f\.) Apply Lemma [30] to choose 4-excellent clusters 
Zj € T and C/j G L such that Zj has at least d'm/A inneighbours in Zj and such that n*- has at 
least d'm/A outneighbours in f/j. We now find the following shifted walks: 

• For each i = 1, . . . t and each j = 1, . . . , Oj — 1 choose a useful walk W'l ^ from f/j to the 
F-successor (Zj_,_j^)+ G L of Z^-j^^. 

• Choose a useful walk Wq from V* to the F-successor {Z\)'^ of Z\. 

• For each i = 1, . . . ,t — 1 choose a useful walk Wl' from to the F-successor {Z'^^'^)'^ 
oiZ{+\ 

• Choose a useful walk Wj-' from [7*^ to V* . 



• Define the shifted walks TV/^ := {Z]yC]Z]Z'C'U'U] for each i = and each 

j = 1 , . . . , Oj , where Cj is the F-cycle containing Zj and where C* is the F-cycle con- 
taining Z*. 

Then, as illustrated in Figure 4, we define 

:= W^W'l,Wl,Wl^Wl^ . . . Wl,^_^Wl,^WlWl^W'^^^ . . . W!,^,^_, . . . Wl,^_,Wl[,^W[' . 

So Wf/^'^ is a shifted walk from V* to itself. When transforming our walk W into a Hamilton 
cycle of G, for each i = 0, . . . , t — 1 we will replace the edge ZjZ* on W/^ by an edge of G 
entering and the edge C/'C/j on W['^ by an edge leaving u*. So we say that z*, ti* are 
prescribed endvertices for these particular occurrences of Z^^Z'^, WUj. 

Note that W^"''^ is composed of 1 + Y2l=i "^^i ^ ^tm ^ 907n walks, each using at most 8/7/' 
clusters (by definition of useful walks). Also Wf^°'^ is composed of at most \L\ ^ k further 
such walks. Using (^), we can choose Wf^°'^ and W^'^ such that the number of times they use 
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Figure 4. A bad walk 



every cluster outside Zj^ U is at most '^^^"'^it^^'^l^^ ^ < (say). Here is the set of 

predecessors of Zi^ on F . 

Let V** £ Rhe the cluster which contains the neighbour in Match/j of the first exceptional 
vertex of type TR in our list (this exists, since \TR\ > \BL\). Then V** is 4-excellent by 
Lemma [32j Define walks Wf^"'^ and W^'^ similarly to W|'""^ and W^'^, where V** now plays 
the role of V* . (So both Wf^""^ and W^'^ are walks from V** to itself.) Now we construct 
our final walk, which we will also call W , as follows. We start with our previous walk W 
joining U* to V* ^ then we add Wj^°'^W^'^ and replace the occurrence of V** mentioned above 
with W^^'^W^'^ . We close up W by adding a useful walk from V* to (?7*)^, and then following 
the path in F from (C/*)"*" to U* . Our final walk W has the properties listed below. 

(a) For each cycle C of F, 1^ visits every cluster of C the same number of times, say mc- 
(b") • W enters every cluster of Rg" at most m times, and thus W exits every cluster at 
most m times. 

• If y G Zl U Zr, then W enters V precisely m times and all the vertices of V are 
prescribed endvertices for these m entering edges oiW . If ^ G ZJ^ U Z^, then W 
exits V precisely m times and all the vertices of V are prescribed endvertices for 
these m exiting edges of W . 

• liV ^ Zi U Zn is not nearly 4-good then W enters V precisely | Ventry \ times and 
the set Ventry is the set of prescribed endvertices for all these |Ventrj/| entering edges 
of W . Similarly, if F ^ Z^ U Z^ is not nearly 4-good then W exits V precisely 
\Vexit\ times and the set Vexit is the set of prescribed endvertices for all these \ Vexit\ 
exiting edges of W . 

• If y is nearly 4-good then W enters V between \ Ventry\ and \ VentTy \ +27^/^m times, 
the set Ventry is the set of prescribed endvertices for \Ventry\ of these entering edges 
of F and no vertex in y is a prescribed endvertex for the other entering edges of W . 
The analogue holds for the exits of at V . 
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(c) W visits every vertex of Vq exactly once. 

(d) For each Xi G Vq we can choose an inneighbour x~ in the cluster preceding Xj on W and 
an outneighbour xf in the cluster following Xj on W, so that as Xi ranges over Vq all 
vertices xf , x~ are distinct. 

Recall that Vq denotes the set of all endvertices of the matching edges in Match^ U Matchy U 
Match/, U Match outside of Vq. Our aim now is to transform W into a Hamilton cycle of G. We 
start by fixing edges in G corresponding to all those edges of W that lie in Rq" but not in F. 
We first do this for all those occurrences VU G E(Rq") \E(F) of edges on W for which there is 
no prescribed endvertex. Note that the second and third conditions in (b") together imply that 
in this case both V and U must be nearly 4-good. Then, applying Lemma [12] as in Section [71 
we can replace each such occurrence VU by an edge from V \ {X* U Vq) to U \ {X* U Vq) 
in G, so that all the edges of G obtained in this way are disjoint. We denote the set of edges 
obtained by £i. Next we choose the edge in G for all those occurrences VU G E{Rg») \ E{F) of 
edges on W which have a prescribed endvertex. This can be achieved by the following greedy 
procedure. Suppose that we have assigned the endvertex tt G [/ to VU . Then V will be 4- 
excellent, so by the last condition in (b") we have chosen at most endvertices in V for 

edges constructed in previous steps. But u has at least d'm/A inneighbours in V, where d' » 7, 
and \V n {X* U Vq)\ ^ 27m, so we can replace VU by vu for some v G V \ {X* U Vq) which is 
distinct from all the vertices chosen before. (This is the point where we need to work with Rq" 
instead of Rq' - we have d <C 7, and so the above argument would fail for Rq'-) We denote the 
set of edges obtained by £2- 

Let £ = £iU£2U£3U£4, where £3 = Match'^^ and £4 = MatchsUMatchrUMatchiUMatch/j. 
(Note that W used each edge in U£^4 precisely once.) For each cluster V let VexU '^V he the 
subset of all initial vertices of edges in £ and let VEntry ^ be the subset of all final vertices 
of edges in £. Then VexU ^ VexU and Ventry ^ VEntry The following lemma provides useful 
properties of these fixed edges. 

Lemma 36. 

(i) £ is a vertex- disjoint union of directed paths, each having at least one endvertex in a 
A-excellent cluster. 

Moreover, every cluster V satisfies the following. 

(fi) \VExit\ = \V^ntry\- 

(in) IfV is nearly A-good then \ VExit\,\yEntry\ ^ A'^^/'^m, {V r\ X*)\Ventry ^ V\VEntry and 

{V r\X*)\ Vexit VexU- Moreover, VexU n VEntry = Vexit n Ventry 

(iv) IfV is nearly A-good then the pairs {V\VExit, ^^\^£„try)G" and {V'\VJ^^■^, V\VEntry)G' 
are {^/e,d'^) -super-regular. 

(v) There is a perfect matching from V \ VexU to V^ \ V^^^^y . 

Proof. By construction every vertex is the initial vertex of at most one edge in £ and the final 
vertex of at most one edge in £, so f is a disjoint union of directed paths and cycles. To prove 
statement (i), we note that £1 forms an independent set of edges in £ and each edge in £1 has 
both endvertices in 4-excellent clusters. Moreover, every edge in £2 has a prescribed endvertex, 
and if M G [/ was prescribed for an edge VU or UV in VF, then F is a 4-excellent cluster and we 
chose v G y so that uv is the only edge of £ containing v. Thus any component of £ containing 
an edge from £1 U 1S2 is a directed path having at least one endvertex in a 4-excellent cluster. 
Also, £3 and £4 are vertex-disjoint, so any component of £ not containing an edge from £1 U £2 
is either a component of iSs, which has the required property by definition of 'pseudo-matching', 
or a directed path consisting of two edges of £4, which has the required property by Lemma [321 
Thus statement (i) holds. 

Condition (ii) follows immediately from our construction of W . The first part of (iii) follows 
from the last part of (b") and the definition of 4-good clusters. To check the remainder of (iii), 
note that the last part of (b") implies that the vertices in VexU \ Vexit and in VEntry \ Ventry 
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are endvertices of edges in E\ U Ei^ or non-prescribed endvertices of edges in 82- We chose the 
endvertices of edges in E\ and the non-prescribed endvertices of edges in E2 to be disjoint from 
each other and from X* U . Also, VJ,* nX* = by definition of X* in (j3| - see Subsection [831 
Altogether, this implies the remainder of (iii). 

To prove the first part of (iv), note that (iii) and Lemma ISSTiv) applied with X' := V\ VexU 
and Y' := y+ \ V^^^^y together imply that [V \ VexU, \ V^„t,.j^)G' is (\/e, fi^)-super-regular. 
The second part of (iv) can be proved similarly. It remains to prove (v). If V is nearly 4-good 
then (v) follows from (ii) and (iv) and Lemma [T2j If y G ZJ^ U then (v) is trivial since 
VexU = V and V^^^^y = by the second condition in (b"). In all other cases the third 
condition of (b") implies that VexU = Vexit and so VEntry ~ ^eritry by (ii) and the fact that 
\Vexit\ = iKntr?;! ^y Lemma IHST iii). Thus in these cases (v) follows from Lemma [551 iii). □ 

Let C denote a spanning subdigraph of G whose edge set consists of £ together with a perfect 
matching from V \ VexU to V'^ \ V^^^^.y for every cluster V . Then C is a 1-factor of G. We 
will show that by choosing a different perfect matching from V \ VexU to \ VEntry some 
clusters V if necessary, we can ensure that C consists of only one cycle, i.e. that C is a Hamilton 
cycle of G. First we show that if U is 4-good then we can merge most vertices of C/^, U and 
into a single cycle of C. 

Lemma 37. We can choose the perfect matchings from V \ VexU to \ V^^^^y (for all clus- 
ters V) so that the following holds for every cluster U . 

(i) // U is nearly A-good then all vertices inU\ (UexU H UEntry) Us on a common cycle Cjj 
ofC. 

(ii) // U is A-good then Gu- = Gu = Gij+ . 

Proof. Recall that if U is nearly 4-good then {UExitlA^Entryl ^ 47^/^m by Lemma [36)^ iii). 
Then it is clear that (i) implies (ii), so it suffices to prove (i). We will consider each nearly 
4-good cluster U in turn and show that we can change the perfect matchings from U~ \ U^^-^ 
to U \ UEntry and from U \ UexU to \ U^^^^y to ensure that U satisfies the conclusions of 
the lemma, and moreover, every V satisfying the conclusions continues to satisfy them. First 
we choose the perfect matching from \ Uj^,^.-^ to U \ U Entry to achieve the following. 

(1) All vertices which were on a common cycle in C are still on a common cycle. 

(2) All vertices in {U~ \ U^^-^) U {U \ UEntry) lie on a common cycle in C. 

Since {U~ \ U^^-^, U\U Entry)G' is ("v/e, d^)-super-regular by Lemma [36lfiv). this can be achieved 
by the same argument used to prove statement (f) at the end of Section [71 Next we apply 
the same argument to merge all the cycles in (the new) C meeting U^ \ U^^^^y into a single 

cycle, which then also contains all the vertices in U \ UexU- Since \UExit\-,\UEntry\ ^ A'y'^/'^m 
by Lemma [36lf iii) ■ we have that UexU U UEntry 7^ U . Thus the 1-factor C obtained in this way 
satisfies (1) and (i) for the cluster U . Continuing in this way for all 4-good clusters and their 
F- neighbours yields a 1-factor C as required in (i). □ 

We will now show that any 1-factor C as in Lemma [37] must consist of a single cycle (and 
thus must be a Hamilton cycle of G). 

Lemma 38. Every 1-factor C of G as in Lemma \37\ satisfies the following conditions. 

(i) For every cycle G of C there exists a A-good cluster U £ L L) R such that G = Gu, i.e. 
G contains all vertices in U \ {UexU H UEntry)- 

(ii) There is one cycle Cl in C which contains U \ {UexU H UEntry) for every A-good cluster 
U G L. Similarly, there is some cycle Cr in C which contains U \ {UexU H UEntry) for 
every A-good cluster U £ R. 

(iii) C = Cl = Cr (and thus C is a Hamilton cycle of G). 
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Proof. First consider the case when C contains at least one edge in £, and let P be the longest 
subpath of C which only consists of edges in £. By Lemma [36li) P has an end vertex x lying 
in a 4-excellent (and so 4-good) cluster U. But x cannot lie in both UexU and U Entry by 
the maximality of the path. Therefore x G Cu, and so C = Cu- Now suppose that C does 
not contain any edges of 8. This means that there is some cycle C £ F such that C 'winds 
around' C , i.e. it only uses clusters in C and in each step moves from V to for some 
V G C. We claim that C cannot be bad. Otherwise, it would contain a cluster V S U Zr. 
But then the second part of (b") implies that VEntry = ^, so C cannot 'wind around' C (The 
purpose of the walks W^"''^ and W^'^ was to exclude this possibility.) Thus C is not bad, so 
contains a 4-good cluster U £ L L) R, which C must meet in at least one vertex, u say. But 
u ^ UexU U U Entry (otherwise one edge at n on C would lie in £). Now Lemma [371fi) implies 
that C contains all vertices in ^7 \ {UexU H U Entry)-, as required. 

To prove the first part of (ii), consider the walk Wf^°°'^ which connected all 4-good clusters 
in L. Let = XiCiX^ X2C2X^ . . . XtCtXf Xt+i, where Xi = Xt+i = V*. Then each 

4-good cluster in L appears at least once in Xi, . . . and Wf^°'^ only uses nearly 4-good 

clusters. Let x^Xi+i be the edge in £ that we have chosen for the edge X~ Xi^i on Wl°°^. As 
neither x~ nor Xj+i was a prescribed endvertex for X^Xj+i, we have x'^ € {X^)Exit \ {X~)exit 
and Xi £ {Xi)Entry \ {Xi)entry Thus x~ ^ {X~)Entry and Xj ^ {Xi)Exit by Lemma [Mliii) • So 
Lemma [371fii) implies that for each i = 2, . . . ,t the vertices x~ and Xi lie on the same cycle of C. 
Trivially, x^ and Xj+i also lie on the same cycle for each i = 1, . . . ,t. This means that all of 
X2, . . . , xt-^-i,x^ , . . . ,Xi lie on the same cycle of C, which we will call Cl. This in turn implies 
that Cl contains U \ {UexU H UEntry) for every 4-good cluster U £ L. Indeed, U = Xi for some 
i = 2, . . . ,t + l and Xi £ UEntry \ UexU ^ \ (UexU n UEntry)- As Xi £ Cl, Lemma EHi) implies 
that Cl contains all vertices in U \ {UexU H UEntry)- A similar argument for W^°'^ establishes 
the existence of Cr. 

To verify (iii), consider an exceptional vertex x of type TR (which exists since we are assuming 
that \TR\ > \BL\). Let x~ and x^ be the neighbours of x on the cycle C £ C which contains x. 
Let X be the cluster containing x~ and X' be the cluster containing x^ . By Lemma [32l X £ T, 
X' £ R and both X and X' are 5-excellent (and thus 4-good). Since x+ £ X'^^^^y \ Xg„^^y 
and so x+ G X' \ {X'^^^^ Pi X'^^^^y) by Lemma [36lf iii) . we must have C = Cr. But on the other 
hand, x~ £ X \ {XexU H XEntry) and X^ £ L is 4-good (since X is 5-excellent). Together with 
Lemma ISTT ii) this implies that C contains X \ {XexU H XEntry), i-e. C = Cl = Cr. Together 
with (i) this now implies that C = C, as required. □ 

10.2. The case when (**) holds. Recall that (**) is the case when \M\ ^ |Vo|/7^. We only 
consider the case when \TR\ ^ \BL\, as the argument for the other case is similar. Let Frl 
denote the set of all those cycles in F which avoid all the clusters in LU R and contain more 
clusters from My^ than from My^. Let Flr denote the set of all other cycles in F which avoid 
all the clusters in LL) R. 

We divide the argument in this subsection into two cases. The main case is when \TR\ — 
\BL\ > \Frl\ + |i*L_R|- We start by showing that we have at least - — jL-J^ — - transitions from B 
to L. Note that \Mv\ ^ |M|/2 ^ \\^\/i2-/^). If |M^^| ^ 1^01/(47^) then, since \TR\ - \BL\ < 
I Vol, we can use vertices in My^ UM^^ as in Subsection 18.51 to obtain the required transitions. 
On the other hand, if |My^| ^ |Vb|/(473) then |My^| ^ |Vb|/(473). Applying Lemma [3¥ i) 
and recalling |Vo| ^ d^^'^n <^ ^^n we obtain 



iMatchBLl ^ min{|M^«|/2, 7^n} - |M^^| - l^ol ^ ^ " l^y"'! " l^ol ^ '^^j^^f^' - |M#^|. 

Thus the pseudo-matching Match^i and the vertices in My^ U M^^ together give at least 
"^^^07^^^ transitions from B to L. 
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We claim we can choose a sub-pseudo- matching Match^^ of Match^i and a set Entry ^ 
My^ with the following properties. 

(i) iMatch's^l + lEntry^il = |ri?| - \BL\ + \Flr\, 

(ii) No cluster contains more than jm/2 endpoints of Match^^. more than 7m/2 vertices 
in Entryj^^. 

(iii) Every cycle in contains at least one vertex of Entry 

To see this, we first choose a vertex in My^ on every cycle in Fjn to include in Entry ^j;^, which 
is possible since \TR\ — \BL\ ^ |i*i?L|- Next we arbitrarily discard one edge from each 2-edge 
path in Match^i to obtain a matching, and then consider a random submatching in which 
every edge is retained with probability 7/4. As in Case 1 of the proof of Lemma [Ml with high 
probability we obtain a submatching of size at least ^ | Match^^. I with at most jm/2 endpoints 
in any cluster. We also arbitrarily choose vertices in each cluster of My^. Then we still 

have at least 7~"'^(|Ti2| — |-B-L|) transitions. Since \TR\ — \BL\ ^ I-FlhI? we can arbitrarily choose 
some of these transitions so that iMatch^^.! + l^ntry^^.! = l^^l ~ \BL\ + |F/^_r|. 

Note that there are no clusters which are M^^-full or full with respect to Match^^. In 
particular, every cluster is 4-good. Next we choose twins as in Subsection 18.51 so that the 
properties in Lemma[551fii)-(iv) hold. Thus we obtain sets ExitsL, Entry^^.; Exit^/'^j Entry^", 
Entry Entry^j^™ as before. 

We now proceed similarly as in Subsection IIU.H forming a walk W that incorporates all 
the exceptional vertices and uses |Ti?| — \BL\ transitions, ending in some 4-excellent cluster 
V* £ L. Since all clusters are 4-good, the bad cycles are precisely those in Flr U Frl- Now we 
cannot construct the walks W^"''^ and W^'^ as before, since the bad cycles avoid LU R. Instead, 
we enlarge W by including a walk Wlr which 'connects' all the cycles in -Fl_r. Suppose that 
Ci, . . . ,Ct are the cycles in Flr and choose a cluster Vi G M^^ on each Cj. Lemma [28]^i) implies 
that, in Rq", each Vj^ has many (at least jik/A) 4-excellent inneighbours in T, while each Vi 
has many 4-excellent outneighbours in R. We pick 4-excellent inneighbours Xj G T of Vj^ and 
4-excellent outneighbours 1^ G i? of Vi for each i. Now we construct Wlr as follows. We start 
at V* G -L, follow a useful shifted walk to Xf G L, then the path in F from to Xi G T and 
then use the edge XiV^ . Next we wind around Ci to Vi, use the edge ViYi and follow a useful 
shifted walk from Yi to one of the | Flu I transitions from B to L that we have not yet used to 
move back to L. We continue in this way until we have 'connected' all the Cj. Finally, we close 
Wlr by following a useful shifted walk back to V* . 

As in Subsection 110. H we fix the edges £ and choose matchings from V \ VexU to \ V^^^^.y 
for each cluster V to obtain a 1-factor C. Note that by construction every vertex outside of Vq 
is incident to at most one edge of £, so VExitf^VEntry = for each cluster V. Lemmas [361 and [371 
still hold, but instead of Lemma [38lfi) we now only have that for every cycle C of C there exists 
a cluster U (which is automatically 4-good) such that C contains all the vertices in U. We then 
deduce that C has a cycle Cl containing all vertices in clusters of L and a cycle Cr containing 
all vertices in clusters of R. Moreover, since we use at least one transition from B to L we have 
Cl = Cr := C . Lemma [371 now implies that for every cycle D in F there is a cycle C m C such 
that C contains all vertices belonging to clusters in D. In particular, C contains all vertices 
belonging to clusters which lie on an F-cycle that intersects LL) R. 

Moreover, if C 7^ C is another cycle in C, then there must be a cycle D in F such that D 
only consists of clusters from My and such that C contains all vertices in U for all clusters U 
on D. U D £ Frl then our choice of Entry implies that some such cluster U on D contains 
a vertex x G Entry The inneighbour y of x in C lies in some cluster Y £ B and so Y~^ 
in R. But C contains all vertices belonging to clusters that lie on an F-cycle which intersects 
R. So C contains all vertices in Y, and thus contains x. This shows that C contains all those 
vertices which belong to clusters lying on cycles from Frl- On the other hand, the walk Wlr 
ensures that C also contains all those vertices which belong to clusters lying on cycles from Flr- 
Altogether this shows that C = C is a Hamilton cycle, as required. 
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It remains to consider the case when ^ l^^-Rl — \BL\ ^ + |-^l_r|- We claim that there 

are at least m/4 transitions from T to R and at least m/4 transitions from B to L. If My^ ^ 0, 
then we can use the vertices in My^ U to get at least m transitions from B to L. On the 
other hand, if M^^ = then \M^^\ ^ |M|/2 ^ \Vo\/{2'y'^) by so Lemma [Mli) implies 

the existence of a pseudo-matching from i? to L of size at least min{|My^|/2, 7^n} — |Vo| ^ 
min{|M^^|/4,7^n/2} ^ m/4. Similarly, if M^^ / then the vertices in M^^ U Mjf- give at 
least m transitions from T to R. On the other hand, if My^ = then Lemma IMlfii) implies 
the existence of a pseudo-matching from T to i? of size at least m/4. Thus in all cases, we have 
at least m/4 transitions in both directions. We can use these transitions to argue similarly as 
in the first case when |Ti?| — \BL\ > |-Frl| -|- |-^Lif,|, but this time we also include a walk Wrl 
into W which 'connects' all the cycles in Frl. Since \TR\ — \BL\ ^ \Frl\ + \Flr\ ^ k <^ m 
there are more than enough transitions. Moreover, if |-Frl| + \Flr\ = 0, then we also make sure 
that W follows at least one transition from B to L (and thus at least one transition from T 
to R). Then we find a Hamilton cycle by the same argument as above. 

10.3. The case when \TR\ = \BL\ and (*) holds. If \TR\ = \BL\ ^ 1 then we can use 
the same procedure as in Subsection 110.11 with no need to use any edges from Match^j;, or 
MatchT^jj. For the remainder of the proof we consider the case when |ri?| = \BL\ = 0. In this 
case, our list of the vertices of Vq has all vertices of type TL followed by all vertices of type BR, 
so we will need to make a transition from incorporating vertices of type TL to type BR and 
then another transition back from type BR to type TL, i.e. we need one transition from T to R 
and another one from BtoL. If M^^ and M^^ are both non-empty, then a similar argument 
as in the second case of the previous subsection implies that there are at least m/4 transitions 
from B to L and at least m/4 transitions from T to R. 

Thus we may suppose that at least one of My^, My^ is empty. We only consider the case 
when My^ = (and My^ could be empty or non-empty), as the other case is similar. Let xi 
and X2 be the first and last vertices of the TL list and yi and y2 the first and last vertices of 
the BR list. Let X2 E L be the cluster containing the outneighbour of X2 in Match 2,, Yi ^ B 
the cluster containing the inneighbour of yi in Match^, Y2 £ R the cluster containing the 
outneighbour of y2 in Match^, and Xi G T the cluster containing the inneighbour of xi in 
Matchy. We need to construct portions of our walk W that link X2 to Yi and Y2 to Xi. (If 
for instance the TL list is empty, we take Xi to be an arbitrary cluster in T.) We consider 
two subcases according to whether there is a cycle of F that contains both a cluster of L and a 
cluster of R. 

Case 1. There is a cycle C of F containing a cluster X €z L and a cluster Y £ R. 

In this case we can reroute along C to construct the required transitions. Let Wi be a shifted 
walk from Y2 to Y^ . This exists since Y2, Y^ G R. Also, since C contains Y £ Rwe can choose 
the walk Wi to go via Y, and C will be one of the cycles traversed. Similarly we can choose a 
shifted walk W2 from X2 to X^ , and we can choose W2 to go via X, so it also traverses C. Now 
we construct the portions of the walk W joining X2 to yi and 2/2 to xi as follows. To join X2 to 
yi we start at X2, follow W2 until it reaches X, then follow C round to Y~ , and then switch to 
Wi, which takes us to Y^ , and we end by traversing a cycle of F to reach Yi. This is balanced 
with respect to all cycles of F except for C, where we have only used the portion from X to 
Y^ . To join y2 to xi we start at Y2, follow Wi until it reaches Y, then follow C round to X~ , 
and then switch to W2, which we follow to X^ , then traverse a cycle of F to reach Xi. This 
is balanced with respect to all cycles of F except for C, where it only uses the portion from Y 
to X~ . But this is exactly the missing portion from the first transition, so in combination they 
are balanced with respect to all cycles of F. This scenario is illustrated in Figure 5. 

Case 2. No cycle of F contains clusters from both L and R. 
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First we observe that in this case we have B C i? u My and T C L U Mh- To see the first 
inclusion, note that if U G B then {/+ G R, so we cannot have U £ L hy our assumption for 
this case. The second inclusion is similar. Since \L\, \R\ = n/2 it 19r/n by Lemma [271 and since 
S{G) ^ /3n, every vertex in Vq has either at least /3n/3 inneighbours in T or at least /3n/3 
inneighbours in B (and similarly for outneighbours in L and R). So by swapping the types of 
the exceptional vertices between TL and BR if necessary, we may assume that each v S TL 
has either at least /3n/3 inneighbours in T or at least /3n/3 outneighbours in L (or both), and 
similarly for the exceptional vertices of type BR. 

Recall that My^ is empty. We consider two further subcases according to whether or not 
My^ is also empty. 

Case 2.1 M^^ / 0. 

We start by choosing an edge yy' from B U BR to L U TL such that y £ B\Vq or y' £ L\ Vq (or 
both). Such an edge exists by Lemma [351 v). since \ Vq\ = 2|Vo| <^ f3n. If both y £ B \ Vq and 
y' G L \ V^*, then we can use yy' for the transition from B to L. Together with suitable useful 
shifted walks in L and in ill this will achieve the transition from ?/2 to xi. For the transition 
from T to R we use one vertex in My^, together with a twin of this vertex in M^^. Now we 
may suppose that y ^ B\Vq or y' ^ L\Vq . We only consider the case when the former holds, 
as the other case is similar. We will still aim to use yy' for the transition from B to L, although 
we need to make the following adjustments according to various cases for y. 

If y € BR then we relabel the BR list so that y2 = y. We can then use the edge yy' (together 
with a suitable useful shifted walk in L) to obtain a transition from 7/2 to xi. 

Suppose next that some edge in Match^j joins an exceptional vertex u £ BR to y. If n has an 
outneighbour v £ R \ {Vq U {y, y'}), then we replace the edge uy by uv and can now use yy' to 
obtain a transition from B to L. If u has no such outneighbour, then u must have at least /3n/3 
inneighbours in B (by our assumption on the exceptional vertices at the beginning of Case 2) 
as well as at least /3n/3 outneighbours in L. Pick such an inneighbour u~ £ B \ Vq and such 
an outneighbour ti+ £ L\Vq . We can now use the path u^uu^ to obtain a transition from B 
to L. 

Finally, suppose that some edge in Match^ joins y to an exceptional vertex u £ BR. If u 
has an inneighbour v £ B \ (Vq* U {y, y'}), then we replace the edge yu by vu and can now use 
yy' to obtain a transition from B to L. If n has no such inneighbour, then u must have at least 
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/3n/3 inneighbours in T. Pick such an inneighbour G T \ (Vq U {y,y'}) and let £ R he 
the outneighbour of u in the matching Match/j. We now use yy' to obtain a transition from B 
to L and the path u~uu^ to obtain a transition from T to R. 

(Note that y cannot be an endvertex of some edge in MatchT or Match/, outside Vq, since 
y £ BU BR and B CRU My.) 

Case 2.2 M^^ = 0. 

In this case we have My = Mh = 0- Thus B R, and since \B\ = \R\ we have B = R. 
Similarly L = T. 

Next suppose that there are exceptional vertices vi ^ V2 such that 

vi has least /3n/3 inneighbours in T and at least j3n/2> outneighbours in R. {vi) 
V2 has least f3n/3 inneighbours in B and at least j3n/2> outneighbours in L. (^2) 

Then we use vi to get a transition from T to R, and use V2 to get a transition from B to L. 
Now we may suppose that we cannot find vi and V2 as above. It may even be that we can find 
neither vi nor V2 as above, in which case the following holds. 

Every vertex of type TL has at least /3n/3 inneighbours in T and at least f3n/2> out- 
neighbours in L. Every vertex of type BR has at least (3n/3 inneighbours B and at least (♦) 
(3n/3 outneighbours in R. 

For the remainder of the proof we suppose that either (fi) or holds, as the case (^2) is 
similar to (fi). We consider the partition V{G) = L'UR' where L' := LUTL and R' = RUBR. 
When (fi) holds we add vi to either L' or R' such that the sets obtained in this way have 
different size. We still denote these sets by L' and R'. The following lemma will supply the 
required transitions. 

Lemma 39. 

(i) When (vi) holds there is an edge yy' from R' \ {vi} to L' \ {vi} having at least one 
endvertex in {LU R) \ V^* . 

(ii) When holds there are two disjoint edges xx' and yy' with x, y' G L' , x', y G R' such 
that both edges have at least one endpoint in (L U R) \ Vq . 

Proof. Suppose first that we have c^(i_^)n/2 ^ + /3)?^/2. Then we have at least (1 + /3)n/2 
vertices of outdegree at least (1 + /3)n/2. Since |L|,|i?| = n/2 it 19r]n (by Lemma [27j) and 
1^0*1 ^ /^^' choose vertices x E L\Vq and y G R\Vq with outdegree at least (l + /3)n/2, 

and then outneighbours x' ^ y in R \Vq of x and y' ^ x in L \Vq of y. Then xx' and 
yy' are the edges required in (ii) and yy' is the edge required in (i). A similar argument 
applies if ^^('i_^)„/2 ^ (1 + /3)n/2. Therefore we can assume that '^('i_^)„/2 ^ (■*- + /5)'^/2 and 
^^i-p)n/2 ^ (l + /5)n/2. Now our degree assumptions give '^(^_^)„/2 ^ '^/^ '^(i-/3)n/2 ^ ""Z^' 
so there are at least (l + /3)n/2 vertices of outdegree at least n/2 and at least (l + /3)n/2 vertices 
of indegree at least n/2. We consider cases according to the size of L' and R'. 

If > \L'\ then we have sets X,Y C L\Vq with |X|, |y| ^ /3n/3 such that every vertex 
in X has at least 2 outneighbours in R' and every vertex in Y has at least 2 inneighbours 
in R'. Then we can obtain the required edges greedily: if (<|k) holds, choose any x £ X, an 
outneighbour x' E R' of x, any y' £ Y with y' ^ x and any inneighbour y £ R' of y' with 
y / x'; if (vi) holds, then we choose y £ R'\ {vi} and an outneighbour y' £ L'\ {vi}. A similar 
argument applies when \L'\ > \R'\. 

Finally we have the case \L'\ = \R'\ = n/2, in which case {^) holds by definition of L' and 
R'. Then we have sets X C. L\Vq and Y C R\Vq of vertices with outdegree at least n/2, with 
\X\, \ Y\ ^ (3n/3. Note that each x £ X has at least one outneighbour in R' and each y £ Y 
has at least one outneighbour in L' . Choose some xq £ X and an outneighbour x'q £ R' of xq. 
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If there is any y G 1", y 7^ Xg with an outneighbour y' ^ xq in L' then xqx'q and yy' are our 
required edges. Otherwise, we have the edge yxQ for every y £ Y with y ^ x'q. So we choose 
some other x € X with x 7^ xq, an outneighbour x' € R' of x and a vertex y gY \ {xq,x'}, and 
our required edges are xx' and yxQ. □ 

Now suppose that (vi) holds. Let yy' be the edge provided by Lemma ISW i). U y £ R\Vq 
and y' £ L\Vq, then we can use yy' for the transition from B = R to L. For the transition 
from r = L to i? we use a path v^vivf such that G L\{VQLl{y, y'}) and G i?\(VQ*U{y, y'}). 
(Such a path exists since vi has many inneighbours in L and many outneighbours in R.) Now 
we may suppose that either y ^ R\Vq or y' ^ L\Vq . We only consider the case when the former 
holds, as the other case is similar. We still aim to use yy' for the transition from B to L, although 
we need to make adjustments as in Case 2.1. For example, consider the case when some edge in 
Matchs joins y to an exceptional vertex u £ BR. If u has an inneighbour v £ B \ (Vq U {y, y'}), 
then we replace the edge yu by vu and can now use yy' to obtain a transition from B to L. If u 
has no such inneighbour, then u must have at least /3n/3 inneighbours in T = L. Choose such 
an inneighbour u~ £ T \ (Vq U {y,y'}) and let u'^ be the outneighbour of u in Match/j. Add 
vi to the set TL, BR which contained it previously. We can now use yy' to obtain a transition 
from B to L and the path u~uu'^ to obtain a transition from T to R. The other cases are 
similar to those in Case 2.1. 

Finally, suppose that {^) holds. Let xx', yy' be the edges provided by Lemma ISW ii) . Then 
by changing the edges in Match l U Match/j U Matchy U Match b if necessary we can ensure that 
x,x',y,y' ^ Vq . Now we can use xx' for the transition from T to R and yy' for the transition 
from B to L. This is clear if none of x,x' ,y,y' lies in Vq. But if we have x £ Vq (for example), 
then X £ TL, and relabelling the TL list so that x = X2 we can use xx' for the transition from X2 
to yi. The other cases are similar. 

In all of the above cases for \TR\ = \BL\ = we obtain a transition from B to L and a transition 
from T to R. Now we can complete the proof as in Subsection llU.li Here no cluster is full, 
and so every cluster is 4-good. Moreover, every vertex outside Vq is an endvertex of at most 
one edge in £. Thus as in Lemma ESI one can show that there are cycles Cl^Cr £ C such that 
Cl contains all vertices belonging to clusters in L, Cr contains all vertices belonging to clusters 
in R and every exceptional vertex lies in Cl or Cr. Moreover, since every cluster is 4-good and 
every vertex outside Vq is an endvertex of at most one edge in Lemma [371 now implies that 
for every cycle D in F there is a cycle C in C such that C" contains all vertices belonging to 
clusters in D. Now considering the transition from B to L (say) we see that Cl = Cr. Since 
{-k) implies that every cycle of F contains at least one cluster from LL) R, we also have that all 
vertices in M are contained in Cl = Cr. Thus C = Cl = Cr is a Hamilton cycle of G. This 
completes the proof of Theorem [5j 



11. A CONCLUDING REMARK 

The following example demonstrates that the degree properties used in our main theorem 
cannot be substantially improved using our current method. Let G be a digraph with V{G) = 
{1, . . . , n} such that ij £ E{G) for every 1 ^ i < j ^ n and also for every 1 ^ j < i ^ an + \ and 
n — an^j<i^n, for some < a < 1/2. Then G has minimum semidegree an and satisfies 
df,d- ^ i - 1 for all 1 ^ i ^ n, so if we apply the regularity lemma it is 'indistinguishable' 
from a digraph satisfying the hypotheses of Conjecture [2j However, any disjoint union of cycles 
in G covers at most 2an vertices, so the argument used in Section 6 breaks down. We remark 
that our argument may well still be useful in combination with a separate method for treating 
the case when the reduced digraph cannot be nearly covered by disjoint cycles. 
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